Correlating causes and effects associated with network activity

ABSTRACT

Embodiments are directed to monitoring network traffic using a monitoring engine that monitors network traffic in networks to provide metrics. An inference engine may provide activity profiles based on portions of the network traffic where each activity profile includes features associated with the portions of network traffic. The inference engine may determine other activity profiles correlated with the activity profiles based on correlation models such that the determination of the other activity profiles occurs prior to monitoring an occurrence of other portions of the network traffic. The inference engine may modify monitoring actions of the monitoring engine based on the other activity profiles. The inference engine may provide reports based on the portions of the network traffic, the activity profiles, the other portions of the network traffic, or the other activity profiles.

TECHNICAL FIELD

The present invention relates generally to network monitoring, and more particularly, but not exclusively, to monitoring networks in a distributed network monitoring environment.

BACKGROUND

On most computer networks, bits of data arranged in bytes are packaged into collections of bytes called packets. These packets are generally communicated between computing devices over networks in a wired or wireless manner. A suite of communication protocols is typically employed to communicate between at least two endpoints over one or more networks. The protocols are typically layered on top of one another to form a protocol stack. One model for a network communication protocol stack is the Open Systems Interconnection (OSI) model, which defines seven layers of different protocols that cooperatively enable communication over a network. The OSI model layers are arranged in the following order: Physical (1), Data Link (2), Network (3), Transport (4), Session (5), Presentation (6), and Application (7).

Another model for a network communication protocol stack is the Internet Protocol (IP) model, which is also known as the Transmission Control Protocol/Internet Protocol (TCP/IP) model. The TCP/IP model is similar to the OSI model except that it defines four layers instead of seven. The TCP/IP model's four layers for network communication protocol are arranged in the following order: Link (1), Internet (2), Transport (3), and Application (4). To reduce the number of layers from four to seven, the TCP/IP model collapses the OSI model's Application, Presentation, and Session layers into its Application layer. Also, the OSI's Physical layer is either assumed or is collapsed into the TCP/IP model's Link layer. Although some communication protocols may be listed at different numbered or named layers of the TCP/IP model versus the OSI model, both of these models describe stacks that include basically the same protocols. For example, the TCP protocol is listed on the fourth layer of the OSI model and on the third layer of the TCP/IP model. To assess and troubleshoot communicated packets and protocols over a network, different types of network monitors can be employed. One type of network monitor, a “packet sniffer” may be employed to generally monitor and record packets of data as they are communicated over a network. Some packet sniffers can display data included in each packet and provide statistics regarding a monitored stream of packets. Also, some types of network monitors are referred to as “protocol analyzers” in part because they can provide additional analysis of monitored and recorded packets regarding a type of network, communication protocol, or application.

Generally, packet sniffers and protocol analyzers passively monitor network traffic without participating in the communication protocols. In some instances, they receive a copy of each packet on a particular network segment or VLAN from one or more members of the network segment. They may receive these packet copies through a port mirror on a managed Ethernet switch, e.g., a Switched Port Analyzer (SPAN) port, a Roving Analysis Port (RAP), or the like, or combinations thereof. Port mirroring enables analysis and debugging of network communications. Port mirroring can be performed for inbound or outbound traffic (or both) on single or multiple interfaces. In other instances, packet copies may be provided to the network monitors from a specialized network tap or from a software entity running on the client or server. In virtual environments, port mirroring may be performed on a virtual switch that is incorporated within the hypervisor.

In complex networks, network activity, such as, requests or responses directed to one device or application may be responsible for causing subsequent activity associated with other devices or applications in the network. Correlating activity that is associated with different devices may be difficult absent disadvantageous or intrusive monitoring mechanisms. Thus, it is with respect to these considerations and others that the present invention has been made.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting and non-exhaustive embodiments of the present innovations are described with reference to the following drawings. In the drawings, like reference numerals refer to like parts throughout the various figures unless otherwise specified. For a better understanding of the described innovations, reference will be made to the following Detailed Description of Various Embodiments, which is to be read in association with the accompanying drawings, wherein:

FIG. 1 illustrates a system environment in which various embodiments may be implemented;

FIG. 2 illustrates a schematic embodiment of a client computer;

FIG. 3 illustrates a schematic embodiment of a network computer;

FIG. 4 illustrates a logical architecture of a system for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments;

FIG. 5 illustrates a logical schematic of a system for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments;

FIG. 6 illustrates a logical representation of a network in accordance with at least one of the various embodiments;

FIG. 7 illustrates a logical representation of a portion of a device relation model in accordance with at least one of the various embodiments;

FIG. 8A illustrates a logical representation of a device relation model showing naïve relationships between the entities in accordance with the one or more embodiments;

FIG. 8B illustrates a logical representation of a device relation model showing informed relationships between the entities in accordance with the one or more embodiments;

FIG. 9A illustrates a logical representation of a device relation model showing relationships between the entities based on observed network connections in accordance with the one or more embodiments;

FIG. 9B illustrates a logical representation of a device relation model showing phantom edges that represent relationships between the entities in accordance with the one or more embodiments;

FIG. 10 illustrates a logical architecture of a network that includes entities in accordance with the one or more embodiments;

FIG. 11 illustrates a logical representation of a data structure for a device relation model that includes entities in accordance with the one or more embodiments;

FIG. 12 represents a logical representation of a system for transforming monitored network traffic into activity profile objects in accordance with one or more of the various embodiment;

FIG. 13A illustrates a logical representation of a system for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments;

FIG. 13B illustrates a logical representation of a system for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments;

FIG. 14 illustrates a logical schematic of a system for monitoring network traffic that may be associated with network activity that may be associated with one or more activity profiles;

FIG. 15A illustrates a logical schematic of requests and responses that may be associated with an entity in a monitored network in accordance with one or more of the various embodiments;

FIG. 15B illustrates a logical schematic of a data structure that may be used to capture information that may be used by an inference engine to determine correlations between different activity profiles in accordance with one or more of the various embodiments;

FIG. 15C illustrates a logical schematic of activity profile correlations that may be discovered by the inference engine based on activity profiles and the network traffic associated with the activity profiles;

FIG. 16 illustrates an overview flowchart of a process for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments;

FIG. 17 illustrates a flowchart of a process for matching network activity with activity profiles for correlating causes and effects associated with the network activity in accordance with one or more of the various embodiments;

FIG. 18 illustrates a flowchart of a process for providing activity profiles based on network activity in accordance with one or more of the various embodiments;

FIG. 19 illustrates a flowchart of a process for adapting or modifying NMC actions based on correlated activity profiles in accordance with one or more of the various embodiments;

FIG. 20 illustrates a flowchart of a process for providing correlation models based on activity profiles and network activity in accordance with one or more of the various embodiments.

DETAILED DESCRIPTION OF VARIOUS EMBODIMENTS

Various embodiments now will be described more fully hereinafter with reference to the accompanying drawings, which form a part hereof, and which show, by way of illustration, specific exemplary embodiments by which the invention may be practiced. The embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the embodiments to those skilled in the art. Among other things, the various embodiments may be methods, systems, media or devices. Accordingly, the various embodiments may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. The following detailed description is, therefore, not to be taken in a limiting sense.

Throughout the specification and claims, the following terms take the meanings explicitly associated herein, unless the context clearly dictates otherwise. The phrase “in one embodiment” as used herein does not necessarily refer to the same embodiment, though it may. Furthermore, the phrase “in another embodiment” as used herein does not necessarily refer to a different embodiment, although it may. Thus, as described below, various embodiments may be readily combined, without departing from the scope or spirit of the invention.

In addition, as used herein, the term “or” is an inclusive “or” operator, and is equivalent to the term “and/or,” unless the context clearly dictates otherwise. The term “based on” is not exclusive and allows for being based on additional factors not described, unless the context clearly dictates otherwise. In addition, throughout the specification, the meaning of “a,” “an,” and “the” include plural references. The meaning of “in” includes “in” and “on.”

For example embodiments, the following terms are also used herein according to the corresponding meaning, unless the context clearly dictates otherwise.

As used herein the term, “engine” refers to logic embodied in hardware or software instructions, which can be written in a programming language, such as C, C++, Objective-C, COBOL, Java™, PHP, Perl, JavaScript, Ruby, VBScript, Microsoft .NET™ languages such as C#, or the like. An engine may be compiled into executable programs or written in interpreted programming languages. Software engines may be callable from other engines or from themselves. Engines described herein refer to one or more logical modules that can be merged with other engines or applications, or can be divided into sub-engines. The engines can be stored in non-transitory computer-readable medium or computer storage device and be stored on and executed by one or more general purpose computers, thus creating a special purpose computer configured to provide the engine.

As used herein, the term “session” refers to a semi-permanent interactive packet interchange between two or more communicating endpoints, such as network devices. A session is set up or established at a certain point in time, and torn down at a later point in time. An established communication session may involve more than one message in each direction. A session may have stateful communication where at least one of the communicating network devices saves information about the session history to be able to communicate. A session may also provide stateless communication, where the communication consists of independent requests with responses between the endpoints. An established session is the basic requirement to perform a connection-oriented communication. A session also is the basic step to transmit in connectionless communication modes.

As used herein, the terms “network connection,” and “connection” refer to communication sessions with a semi-permanent connection for interactive packet interchange between two or more communicating endpoints, such as network devices. The connection may be established before application data is transferred, and where a stream of data is delivered in the same or different order than it was sent. The alternative to connection-oriented transmission is connectionless communication. For example, the datagram mode of communication used by the Internet Protocol (IP) and the Universal Datagram Protocol (UDP) may deliver packets out of order, since different packets may be routed independently and could be delivered over different paths. Packets associated with a TCP protocol connection may also be routed independently and could be delivered over different paths. However, for TCP connections the network communication system may provide the packets to application endpoints in the correct order.

Connection-oriented communication may be a packet-mode virtual circuit connection. For example, a transport layer virtual circuit protocol such as the TCP protocol can deliver packets of data in order although the lower layer switching is connectionless. A connection-oriented transport layer protocol such as TCP can also provide connection-oriented communications over connectionless communication. For example, if TCP is based on a connectionless network layer protocol (such as IP), this TCP/IP protocol can then achieve in-order delivery of a byte stream of data, by means of segment sequence numbering on the sender side, packet buffering and data packet reordering on the receiver side. Alternatively, the virtual circuit connection may be established in a datalink layer or network layer switching mode, where all data packets belonging to the same traffic stream are delivered over the same path, and traffic flows are identified by some connection identifier rather than by complete routing information, which enables fast hardware based switching.

As used herein, the terms “session flow” and “network flow” refer to one or more network packets or a stream of network packets that are communicated in a session that is established between at least two endpoints, such as two network devices. In one or more of the various embodiments, flows may be useful if one or more of the endpoints of a session may be behind a network traffic management device, such as a firewall, switch, router, load balancer, or the like. In one or more of the various embodiments, such flows may be used to ensure that the packets sent between the endpoints of a flow may be routed appropriately.

Typically, establishing a TCP based connection between endpoints begins with the execution of an initialization protocol and creates a single bi-directional flow between two endpoints, e.g., one direction of flow going from endpoint A to endpoint B, the other direction of the flow going from endpoint B to endpoint A, where each endpoint is at least identified by an IP address and a TCP port.

Also, some protocols or network applications may establish a separate flow for control information that enables management of at least one or more flows between two or more endpoints. Further, in some embodiments, network flows may be half-flows that may be unidirectional.

As used herein, the term “tuple” refers to a set of values that identify a source and destination of a network packet, which may, under some circumstances, be a part of a network connection. In one embodiment, a tuple may include a source Internet Protocol (IP) address, a destination IP address, a source port number, a destination port number, virtual LAN segment identifier (VLAN ID), tunnel identifier, routing interface identifier, physical interface identifier, or a protocol identifier. Tuples may be used to identify network flows (e.g., connection flows).

As used herein the term “related flows,” or “related network flows” as used herein are network flows that while separate they are operating cooperatively. For example, some protocols, such as, FTP, SIP, RTP, VOIP, custom protocols, or the like, may provide control communication over one network flow and data communication over other network flows. Further, configuration rules may define one or more criteria that are used to recognize that two or more network flows should be considered related flows. For example, configuration rules may define that flows containing a particular field value should be grouped with other flows having the same field value, such as, a cookie value, or the like.

As used herein, the terms “network monitor”, “network monitoring computer”, or “NMC” refer to an application (software, hardware, or some combination) that is arranged to monitor and record flows of packets in a session that are communicated between at least two endpoints over at least one network. The NMC can provide information for assessing different aspects of these monitored flows. In one or more embodiment, the NMC may passively monitor network packet traffic without participating in the communication protocols. This monitoring may be performed for a variety of reasons, including troubleshooting and proactive remediation, end-user experience monitoring, SLA monitoring, capacity planning, application lifecycle management, infrastructure change management, infrastructure optimization, business intelligence, security, and regulatory compliance. The NMC can receive network communication for monitoring through a variety of means including network taps, wireless receivers, port mirrors or directed tunnels from network switches, clients or servers including the endpoints themselves, or other infrastructure devices. In at least some of the various embodiments, the NMC may receive a copy of each packet on a particular network segment or virtual local area network (VLAN). Also, for at least some of the various embodiments, they may receive these packet copies through a port mirror on a managed Ethernet switch, e.g., a Switched Port Analyzer (SPAN) port, a Roving Analysis Port (RAP), or the like, or combination thereof. Port mirroring enables analysis and debugging of network communications. Port mirroring can be performed for inbound or outbound traffic (or both) on single or multiple interfaces.

The NMC may track network connections from and to end points such as a client or a server. The NMC may also extract information from the packets including protocol information at various layers of the communication protocol stack. The NMC may reassemble or reconstruct the stream of data exchanged between the endpoints. The NMC may perform decryption of the payload at various layers of the protocol stack. The NMC may passively monitor the network traffic or it may participate in the protocols as a proxy. The NMC may attempt to classify the network traffic according to communication protocols that are used.

The NMC may also perform one or more actions for classifying protocols that may be a necessary precondition for application classification. While some protocols run on well-known ports, others do not. Thus, even if there is traffic on a well-known port, it is not necessarily the protocol generally understood to be assigned to that port. As a result, the NMC may perform protocol classification using one or more techniques, such as, signature matching, statistical analysis, traffic analysis, and other heuristics. In some cases, the NMC may use adaptive protocol classification techniques where information used to classify the protocols may be accumulated or applied over time to further classify the observed protocols. In some embodiments, NMCs may be arranged to employ stateful analysis. Accordingly, for each supported protocols, an NMC may use network packet payload data to drive a state machine that mimics the protocol state changes in the client/server flows being monitored. The NMC may categorize the traffic where categories might include file transfers, streaming audio, streaming video, database access, interactive, gaming, and the like. The NMC may attempt to determine whether the traffic corresponds to known communications protocols, such as HTTP, FTP, SMTP, RTP, TDS, TCP, IP, and the like.

In one or more of the various embodiments, NMCs or NMC functionality may be implemented using hardware or software based proxy devices that may be arranged to intercept network traffic in the monitored networks.

As used herein, the terms “layer” and “model layer” refer to a layer of one or more communication protocols in a stack of communication protocol layers that are defined by a model, such as the OSI model and the TCP/IP (IP) model. The OSI model defines seven layers and the TCP/IP model defines four layers of communication protocols.

For example, at the OSI model's lowest or first layer (Physical), streams of electrical/light/radio impulses (bits) are communicated between computing devices over some type of media, such as cables, network interface cards, radio wave transmitters, and the like. At the next or second layer (Data Link), bits are encoded into packets and packets are also decoded into bits. The Data Link layer also has two sub-layers, the Media Access Control (MAC) sub-layer and the Logical Link Control (LLC) sub-layer. The MAC sub-layer controls how a computing device gains access to the data and permission to transmit it. The LLC sub-layer controls frame synchronization, flow control and error checking. At the third layer (Network), logical paths are created, known as virtual circuits, to communicated data from node to node. Routing, forwarding, addressing, internetworking, error handling, congestion control, and packet sequencing are functions of the Network layer. At the fourth layer (Transport), transparent transfer of data between end computing devices, or hosts, is provided. The Transport layer is responsible for end to end recovery and flow control to ensure complete data transfer over the network.

At the fifth layer (Session) of the OSI model, connections between applications are established, managed, and terminated. The Session layer sets up, coordinates, and terminates conversations, exchanges, and dialogues between applications at each end of a connection. At the sixth layer (Presentation), independence from differences in data representation, e.g., encryption, is provided by translating from application to network format and vice versa. Generally, the Presentation layer transforms data into the form that the protocols at the Application layer (7) can accept. For example, the Presentation layer generally handles the formatting and encrypting/decrypting of data that is communicated across a network.

At the top or seventh layer (Application) of the OSI model, application and end user processes are supported. For example, communication partners may be identified, quality of service can be identified, user authentication and privacy may be considered, and constraints on data syntax can be identified. Generally, the Application layer provides services for file transfer, messaging, and displaying data. Protocols at the Application layer include FTP, HTTP, and Telnet.

To reduce the number of layers from seven to four, the TCP/IP model collapses the OSI model's Application, Presentation, and Session layers into its Application layer. Also, the OSI's Physical layer is either assumed or may be collapsed into the TCP/IP model's Link layer. Although some communication protocols may be listed at different numbered or named layers of the TCP/IP model versus the OSI model, both of these models describe stacks that include basically the same protocols.

As used herein, the term “entity” refers to an actor in the monitored network. Entities may include applications, services, programs, processes, network devices, network computers, client computers, or the like, operating in the monitored network. For example, individual entities may include, web clients, web servers, database clients, database servers, mobile app clients, payment processors, groupware clients, groupware services, or the like. In some cases, multiple entities may co-exist on or in the same network computer, process, application, compute container, or cloud compute instance.

As used herein, the term “device relation model” refers to a data structure that is used to represent relationships between and among different entities in a monitored network. Device relation models may be graph models comprised of nodes and edges stored in the memory of a network computer. In some embodiments, the network computer may automatically update the configuration and composition of the device relation model stored in the memory of the network computer to reflect the relationships between two or more entities in the monitored network. Nodes of the graph model may represent entities in the network and the edges of the graph model represent the relationship between entities in the network. Device relation models may improve the performance of computers at least by enabling a compact representation of entities and relationships in large networks to reduce memory requirements.

As used herein, the “device profile” refers to a data structure that represents the characteristics of network devices or entities that are discovered in networks monitored by NMCs. Values or fields in device profiles may be based on metrics, network traffic characteristics, network footprints, or the like, that have been collected based on passive network monitoring of network traffic in one or more monitored networks. Device profiles may be provided for various network devices, such as, client computers, server computers, application server computers, networked storage devices, routers, switches, firewalls, virtual machines, cloud instances, or the like.

As used herein, the “application profile” refers to a data structure that represents the characteristics of applications or services that are discovered in networks monitored by NMCs. Values or fields in application profiles may be based on metrics, network traffic characteristics, network footprints, or the like, that have been collected based on passive network monitoring of network traffic in one or more monitored networks. Application profiles may be provided for various applications, such as, client computers, server computers, application server computers, networked storage devices, routers, switches, firewalls, virtual machines, cloud instances, or the like. For example, application profiles may be provided for web clients, web servers, database clients, database servers, credentialing services, mobile application clients, payment processors, groupware clients, groupware services, micro-services, container based services, document management clients, document management services, billing/invoicing systems, building management services, healthcare management services, VOIP clients, VOIP servers, or the like.

As used herein, the term “entity profile” refers to a data structure that represent the characteristics of a network entity that may be a combination of device profiles and application profiles. Entity profiles may also include additional values or fields based on metrics, network traffic characteristics, network footprint, or the like, that have been collected based on passive network monitoring of network traffic in one or more monitored networks. For example, an entity profile may be provided for application servers where the entity profile is made from some or all of the device profile of the computer running or hosting the applications and some or all of the application profiles associated with the applications or services that are running or hosting one the computer. In some cases, multiple services or applications running on devices may be included in the same entity profile. In other cases, entity profiles may be arranged in hierarchal data structure similar to an object oriented computer languages class hierarchy.

As used herein, the terms “activity profile,” “request profile,” or “response profile” refer to data structures that that represent the characteristics of particular classes, types, or categorizations of network activity. This may include activity directed to devices or entities as well as activity generated by devices or entities. Activity profiles may include additional values or fields based on metrics, network traffic characteristics, activity content/traffic, or the like, that have been collected based on passive network monitoring of network traffic in one or more monitored networks. In some cases, activity profiles may be associated with one or more application profiles or entity profiles. Request profiles may be activity profiles that are associated with network activity associated with client requests provided to an entity. Similarly, response profiles may be associated with activity that may be provided to respond to a previously received request.

As used herein, the term “observation port” refers to network taps, wireless receivers, port mirrors or directed tunnels from network switches, clients or servers, virtual machines, cloud computing instances, other network infrastructure devices or processes, or the like, or combination thereof. Observation ports may provide a copy of each network packet included in wire traffic on a particular network segment or virtual local area network (VLAN). Also, for at least some of the various embodiments, observation ports may provide NMCs network packet copies through a port mirror on a managed Ethernet switch, e.g., a Switched Port Analyzer (SPAN) port, or a Roving Analysis Port (RAP).

The following briefly describes embodiments of the invention in order to provide a basic understanding of some aspects of the invention. This brief description is not intended as an extensive overview. It is not intended to identify key or critical elements, or to delineate or otherwise narrow the scope. Its purpose is merely to present some concepts in a simplified form as a prelude to the more detailed description that is presented later.

Briefly stated, various embodiments are directed to monitoring network traffic using one or more network computers. In one or more of the various embodiments, a monitoring engine may be instantiated to perform various actions. In one or more of the various embodiments, the monitoring engine may be arranged to monitor network traffic that may be associated with a plurality of entities in one or more networks to provide one or more metrics.

In one or more of the various embodiments, an inference engine may be instantiated to that performs various actions. In one or more of the various embodiments, the inference engine may be arranged to provide one or more activity profiles based on the plurality of entities and one or more portions of the network traffic such that each activity profile includes features based on the one or more metrics, the plurality of entities, or the one or more portions of network traffic. In one or more of the various embodiments, providing the one or more activity profiles may include matching the one or more portions of the network traffic to the one or more activity profiles based on one or more characteristics of the one or more portions network traffic such that the one or more characteristics include one or more of packet size, bit rate of the network traffic, communication protocol, encryption status, encryption state, encryption version, encryption type, encryption cipher, error rate, device relations based on one or more device relation models, time-of-day, day-of-week, user, user group, source tuple information, destination tuple information, application type, service type, payload or header content, geolocation information, or the like.

In one or more of the various embodiments, providing the one or more activity profiles may include providing computer readable instructions that may be executed to enforce one or more matching rules or filter rules that are associated with the one or more activity profiles such that the one or more matching rules or filter rules include one or more of regular expressions, one or more compound rules, one or more cascading rules, or one or more sub-rules.

In one or more of the various embodiments, the inference engine may be arranged to determine one or more other activity profiles that may correlate with the one or more activity profiles based on one or more correlation models such that the determination of the one or more other activity profiles may occur prior to monitoring an occurrence of one or more other portions of the network traffic that may be associated with the one or more other activity profiles.

In one or more of the various embodiments, the inference engine may be arranged to modify one or more actions of the monitoring engine based on the one or more other activity profiles. In one or more of the various embodiments, modifying the one or more actions of the monitoring engine may include modifying the monitoring of the network traffic based on the one or more other activity profiles such that the modifications include collecting one or more additional metrics associated with the one or more other portions of the network traffic that may be associated with the one or more other activity profiles.

In one or more of the various embodiments, the inference engine may be arranged to associate the one or more portions of the network traffic with the one or more activity profiles based on the one or more portions of the network traffic matching one or more of the features that may be associated with the one or more activity profiles.

In one or more of the various embodiments, the inference engine may be arranged to provide one or more reports based on the one or more portions of the network traffic, the one or more activity profiles, the one or more other portions of the network traffic, or the one or more other activity profiles, wherein the one or more reports are provided to one or more users.

In one or more of the various embodiments, the inference engine may be arranged to train the one or more correlation models based on the network traffic and the one or more activity profiles such that the one or more correlation models provide correlations that predict a likelihood of the occurrence of the one or more other portions of network traffic subsequent to the occurrence of the one or more portions of the network traffic based on one or more dependencies associated with the one or more activity profiles and the one or more other activity profiles.

In one or more of the various embodiments, the inference engine may be arranged to determine a score for each of the one or more correlation models based on the occurrence of the one or more other portions of network traffic subsequent to the one or more portions of the network traffic. In some embodiment, the inference engine may determine the one or more correlation models that may require re-training based on the determined score having a value that may be below a threshold value. And, in one or more of the various embodiments, the inference engine may be arranged to re-train the one or more determined correlation models based on the network traffic.

Illustrated Operating Environment

FIG. 1 shows components of one embodiment of an environment in which embodiments of the invention may be practiced. Not all of the components may be required to practice the invention, and variations in the arrangement and type of the components may be made without departing from the spirit or scope of the invention. As shown, system 100 of FIG. 1 includes local area networks (LANs)/wide area networks (WANs)—(network) 110, wireless network 108, client computers 102-105, application server computer 116, network monitoring computer 118, or the like.

At least one embodiment of client computers 102-105 is described in more detail below in conjunction with FIG. 2. In one embodiment, at least some of client computers 102-105 may operate over one or more wired or wireless networks, such as networks 108, or 110. Generally, client computers 102-105 may include virtually any computer capable of communicating over a network to send and receive information, perform various online activities, offline actions, or the like. In one embodiment, one or more of client computers 102-105 may be configured to operate within a business or other entity to perform a variety of services for the business or other entity. For example, client computers 102-105 may be configured to operate as a web server, firewall, client application, media player, mobile telephone, game console, desktop computer, or the like. However, client computers 102-105 are not constrained to these services and may also be employed, for example, as for end-user computing in other embodiments. It should be recognized that more or less client computers (as shown in FIG. 1) may be included within a system such as described herein, and embodiments are therefore not constrained by the number or type of client computers employed.

Computers that may operate as client computer 102 may include computers that typically connect using a wired or wireless communications medium such as personal computers, multiprocessor systems, microprocessor-based or programmable electronic devices, network PCs, or the like. In some embodiments, client computers 102-105 may include virtually any portable computer capable of connecting to another computer and receiving information such as, laptop computer 103, mobile computer 104, tablet computers 105, or the like. However, portable computers are not so limited and may also include other portable computers such as cellular telephones, display pagers, radio frequency (RF) devices, infrared (IR) devices, Personal Digital Assistants (PDAs), handheld computers, wearable computers, integrated devices combining one or more of the preceding computers, or the like. As such, client computers 102-105 typically range widely in terms of capabilities and features. Moreover, client computers 102-105 may access various computing applications, including a browser, or other web-based application.

A web-enabled client computer may include a browser application that is configured to send requests and receive responses over the web. The browser application may be configured to receive and display graphics, text, multimedia, and the like, employing virtually any web-based language. In one embodiment, the browser application is enabled to employ JavaScript, HyperText Markup Language (HTML), eXtensible Markup Language (XML), JavaScript Object Notation (JSON), Cascading Style Sheets (CS S), or the like, or combination thereof, to display and send a message. In one embodiment, a user of the client computer may employ the browser application to perform various activities over a network (online). However, another application may also be used to perform various online activities.

Client computers 102-105 also may include at least one other client application that is configured to receive or send content between another computer. The client application may include a capability to send or receive content, or the like. The client application may further provide information that identifies itself, including a type, capability, name, and the like. In one embodiment, client computers 102-105 may uniquely identify themselves through any of a variety of mechanisms, including an Internet Protocol (IP) address, a phone number, Mobile Identification Number (MIN), an electronic serial number (ESN), a client certificate, or other device identifier. Such information may be provided in one or more network packets, or the like, sent between other client computers, application server computer 116, network monitoring computer 118, or other computers.

Client computers 102-105 may further be configured to include a client application that enables an end-user to log into an end-user account that may be managed by another computer, such as application server computer 116, network monitoring computer 118, or the like. Such an end-user account, in one non-limiting example, may be configured to enable the end-user to manage one or more online activities, including in one non-limiting example, project management, software development, system administration, configuration management, search activities, social networking activities, browse various websites, communicate with other users, or the like. Further, client computers may be arranged to enable users to provide configuration information, policy information, or the like, to network monitoring computer 118. Also, client computers may be arranged to enable users to display reports, interactive user-interfaces, results provided by network monitor computer 118, or the like.

Wireless network 108 is configured to couple client computers 103-105 and its components with network 110. Wireless network 108 may include any of a variety of wireless sub-networks that may further overlay stand-alone ad-hoc networks, and the like, to provide an infrastructure-oriented connection for client computers 103-105. Such sub-networks may include mesh networks, Wireless LAN (WLAN) networks, cellular networks, and the like. In one embodiment, the system may include more than one wireless network.

Wireless network 108 may further include an autonomous system of terminals, gateways, routers, and the like connected by wireless radio links, and the like. These connectors may be configured to move freely and randomly and organize themselves arbitrarily, such that the topology of wireless network 108 may change rapidly.

Wireless network 108 may further employ a plurality of access technologies including 2nd (2G), 3rd (3G), 4th (4G) 5th (5G) generation radio access for cellular systems, WLAN, Wireless Router (WR) mesh, and the like. Access technologies such as 2G, 3G, 4G, 5G, and future access networks may enable wide area coverage for mobile computers, such as client computers 103-105 with various degrees of mobility. In one non-limiting example, wireless network 108 may enable a radio connection through a radio network access such as Global System for Mobil communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), code division multiple access (CDMA), time division multiple access (TDMA), Wideband Code Division Multiple Access (WCDMA), High Speed Downlink Packet Access (HSDPA), Long Term Evolution (LTE), and the like. In essence, wireless network 108 may include virtually any wireless communication mechanism by which information may travel between client computers 103-105 and another computer, network, a cloud-based network, a cloud instance, or the like.

Network 110 is configured to couple network computers with other computers, including, application server computer 116, network monitoring computer 118, client computers 102-105 through wireless network 108, or the like. Network 110 is enabled to employ any form of computer readable media for communicating information from one electronic device to another. Also, network 110 can include the Internet in addition to local area networks (LANs), wide area networks (WANs), direct connections, such as through a universal serial bus (USB) port, Ethernet port, other forms of computer-readable media, or any combination thereof. On an interconnected set of LANs, including those based on differing architectures and protocols, a router acts as a link between LANs, enabling messages to be sent from one to another. In addition, communication links within LANs typically include twisted wire pair or coaxial cable, while communication links between networks may utilize analog telephone lines, full or fractional dedicated digital lines including T1, T2, T3, and T4, or other carrier mechanisms including, for example, E-carriers, Integrated Services Digital Networks (ISDNs), Digital Subscriber Lines (DSLs), wireless links including satellite links, or other communications links known to those skilled in the art. Moreover, communication links may further employ any of a variety of digital signaling technologies, including without limit, for example, DS-0, DS-1, DS-2, DS-3, DS-4, OC-3, OC-12, OC-48, or the like. Furthermore, remote computers and other related electronic devices could be remotely connected to either LANs or WANs via a modem and temporary telephone link. In one embodiment, network 110 may be configured to transport information using one or more network protocols, such Internet Protocol (IP).

Additionally, communication media typically embodies computer readable instructions, data structures, program modules, or other transport mechanism and includes any information non-transitory delivery media or transitory delivery media. By way of example, communication media includes wired media such as twisted pair, coaxial cable, fiber optics, wave guides, and other wired media and wireless media such as acoustic, RF, infrared, and other wireless media.

One embodiment of application server computer 116 is described in more detail below in conjunction with FIG. 3. One embodiment of network monitoring computer 118 is described in more detail below in conjunction with FIG. 3. Although FIG. 1 illustrates application server computer 116, and network monitoring computer 118, each as a single computer, the innovations or embodiments are not so limited. For example, one or more functions of application server computer 116, network monitoring computer 118, or the like, may be distributed across one or more distinct network computers. Moreover, in one or more embodiment, network monitoring computer 118 may be implemented using a plurality of network computers. Further, in one or more of the various embodiments, application server computer 116, or network monitoring computer 118 may be implemented using one or more cloud instances in one or more cloud networks. Accordingly, these innovations and embodiments are not to be construed as being limited to a single environment, and other configurations, and other architectures are also envisaged.

Illustrative Client Computer

FIG. 2 shows one embodiment of client computer 200 that may include many more or less components than those shown. Client computer 200 may represent, for example, at least one embodiment of mobile computers or client computers shown in FIG. 1.

Client computer 200 may include processor 202 in communication with memory 204 via bus 228. Client computer 200 may also include power supply 230, network interface 232, audio interface 256, display 250, keypad 252, illuminator 254, video interface 242, input/output interface 238, haptic interface 264, global positioning systems (GPS) receiver 258, open air gesture interface 260, temperature interface 262, camera(s) 240, projector 246, pointing device interface 266, processor-readable stationary storage device 234, and processor-readable removable storage device 236. Client computer 200 may optionally communicate with a base station (not shown), or directly with another computer. And in one embodiment, although not shown, a gyroscope may be employed within client computer 200 for measuring or maintaining an orientation of client computer 200.

Power supply 230 may provide power to client computer 200. A rechargeable or non-rechargeable battery may be used to provide power. The power may also be provided by an external power source, such as an AC adapter or a powered docking cradle that supplements or recharges the battery.

Network interface 232 includes circuitry for coupling client computer 200 to one or more networks, and is constructed for use with one or more communication protocols and technologies including, but not limited to, protocols and technologies that implement any portion of the OSI model for mobile communication (GSM), CDMA, time division multiple access (TDMA), UDP, TCP/IP, SMS, MMS, GPRS, WAP, UWB, WiMax, SIP/RTP, GPRS, EDGE, WCDMA, LTE, UMTS, OFDM, CDMA2000, EV-DO, HSDPA, or any of a variety of other wireless communication protocols. Network interface 232 is sometimes known as a transceiver, transceiving device, or network interface card (MC).

Audio interface 256 may be arranged to produce and receive audio signals such as the sound of a human voice. For example, audio interface 256 may be coupled to a speaker and microphone (not shown) to enable telecommunication with others or generate an audio acknowledgement for some action. A microphone in audio interface 256 can also be used for input to or control of client computer 200, e.g., using voice recognition, detecting touch based on sound, and the like.

Display 250 may be a liquid crystal display (LCD), gas plasma, electronic ink, light emitting diode (LED), Organic LED (OLED) or any other type of light reflective or light transmissive display that can be used with a computer. Display 250 may also include a touch interface 244 arranged to receive input from an object such as a stylus or a digit from a human hand, and may use resistive, capacitive, surface acoustic wave (SAW), infrared, radar, or other technologies to sense touch or gestures.

Projector 246 may be a remote handheld projector or an integrated projector that is capable of projecting an image on a remote wall or any other reflective object such as a remote screen.

Video interface 242 may be arranged to capture video images, such as a still photo, a video segment, an infrared video, or the like. For example, video interface 242 may be coupled to a digital video camera, a web-camera, or the like. Video interface 242 may comprise a lens, an image sensor, and other electronics. Image sensors may include a complementary metal-oxide-semiconductor (CMOS) integrated circuit, charge-coupled device (CCD), or any other integrated circuit for sensing light.

Keypad 252 may comprise any input device arranged to receive input from a user. For example, keypad 252 may include a push button numeric dial, or a keyboard. Keypad 252 may also include command buttons that are associated with selecting and sending images.

Illuminator 254 may provide a status indication or provide light. Illuminator 254 may remain active for specific periods of time or in response to event messages. For example, when illuminator 254 is active, it may backlight the buttons on keypad 252 and stay on while the client computer is powered. Also, illuminator 254 may backlight these buttons in various patterns when particular actions are performed, such as dialing another client computer. Illuminator 254 may also cause light sources positioned within a transparent or translucent case of the client computer to illuminate in response to actions.

Further, client computer 200 may also comprise hardware security module (HSM) 268 for providing additional tamper resistant safeguards for generating, storing or using security/cryptographic information such as, keys, digital certificates, passwords, passphrases, two-factor authentication information, or the like. In some embodiments, hardware security module may be employed to support one or more standard public key infrastructures (PKI), and may be employed to generate, manage, or store keys pairs, or the like. In some embodiments, HSM 268 may be a stand-alone computer, in other cases, HSM 268 may be arranged as a hardware card that may be added to a client computer.

Client computer 200 may also comprise input/output interface 238 for communicating with external peripheral devices or other computers such as other client computers and network computers. The peripheral devices may include an audio headset, virtual reality headsets, display screen glasses, remote speaker system, remote speaker and microphone system, and the like. Input/output interface 238 can utilize one or more technologies, such as Universal Serial Bus (USB), Infrared, WiFi, WiMax, Bluetooth™, and the like.

Input/output interface 238 may also include one or more sensors for determining geolocation information (e.g., GPS), monitoring electrical power conditions (e.g., voltage sensors, current sensors, frequency sensors, and so on), monitoring weather (e.g., thermostats, barometers, anemometers, humidity detectors, precipitation scales, or the like), or the like. Sensors may be one or more hardware sensors that collect or measure data that is external to client computer 200.

Haptic interface 264 may be arranged to provide tactile feedback to a user of the client computer. For example, the haptic interface 264 may be employed to vibrate client computer 200 in a particular way when another user of a computer is calling. Temperature interface 262 may be used to provide a temperature measurement input or a temperature changing output to a user of client computer 200. Open air gesture interface 260 may sense physical gestures of a user of client computer 200, for example, by using single or stereo video cameras, radar, a gyroscopic sensor inside a computer held or worn by the user, or the like. Camera 240 may be used to track physical eye movements of a user of client computer 200.

GPS transceiver 258 can determine the physical coordinates of client computer 200 on the surface of the Earth, which typically outputs a location as latitude and longitude values. GPS transceiver 258 can also employ other geo-positioning mechanisms, including, but not limited to, triangulation, assisted GPS (AGPS), Enhanced Observed Time Difference (E-OTD), Cell Identifier (CI), Service Area Identifier (SAI), Enhanced Timing Advance (ETA), Base Station Subsystem (BSS), or the like, to further determine the physical location of client computer 200 on the surface of the Earth. It is understood that under different conditions, GPS transceiver 258 can determine a physical location for client computer 200. In one or more embodiment, however, client computer 200 may, through other components, provide other information that may be employed to determine a physical location of the client computer, including for example, a Media Access Control (MAC) address, IP address, and the like.

Human interface components can be peripheral devices that are physically separate from client computer 200, allowing for remote input or output to client computer 200. For example, information routed as described here through human interface components such as display 250 or keyboard 252 can instead be routed through network interface 232 to appropriate human interface components located remotely. Examples of human interface peripheral components that may be remote include, but are not limited to, audio devices, pointing devices, keypads, displays, cameras, projectors, and the like. These peripheral components may communicate over a Pico Network such as Bluetooth™, Zigbee™ and the like. One non-limiting example of a client computer with such peripheral human interface components is a wearable computer, which might include a remote pico projector along with one or more cameras that remotely communicate with a separately located client computer to sense a user's gestures toward portions of an image projected by the pico projector onto a reflected surface such as a wall or the user's hand.

A client computer may include web browser application 226 that is configured to receive and to send web pages, web-based messages, graphics, text, multimedia, and the like. The client computer's browser application may employ virtually any programming language, including a wireless application protocol messages (WAP), and the like. In one or more embodiment, the browser application is enabled to employ Handheld Device Markup Language (HDML), Wireless Markup Language (WML), WMLScript, JavaScript, Standard Generalized Markup Language (SGML), HyperText Markup Language (HTML), eXtensible Markup Language (XML), HTMLS, and the like.

Memory 204 may include RAM, ROM, or other types of memory. Memory 204 illustrates an example of computer-readable storage media (devices) for storage of information such as computer-readable instructions, data structures, program modules or other data. Memory 204 may store BIOS 208 for controlling low-level operation of client computer 200. The memory may also store operating system 206 for controlling the operation of client computer 200. It will be appreciated that this component may include a general-purpose operating system such as a version of UNIX, or LINUX™, or a specialized client computer communication operating system such as Windows Phone™, or the Symbian® operating system. The operating system may include, or interface with a Java virtual machine module that enables control of hardware components or operating system operations via Java application programs.

Memory 204 may further include one or more data storage 210, which can be utilized by client computer 200 to store, among other things, applications 220 or other data. For example, data storage 210 may also be employed to store information that describes various capabilities of client computer 200. The information may then be provided to another device or computer based on any of a variety of methods, including being sent as part of a header during a communication, sent upon request, or the like. Data storage 210 may also be employed to store social networking information including address books, buddy lists, aliases, user profile information, or the like. Data storage 210 may further include program code, data, algorithms, and the like, for use by a processor, such as processor 202 to execute and perform actions. In one embodiment, at least some of data storage 210 might also be stored on another component of client computer 200, including, but not limited to, non-transitory processor-readable removable storage device 236, processor-readable stationary storage device 234, or even external to the client computer.

Applications 220 may include computer executable instructions which, when executed by client computer 200, transmit, receive, or otherwise process instructions and data. Applications 220 may include, for example, other client applications 224, web browser 226, or the like. Client computers may be arranged to exchange communications, such as, queries, searches, messages, notification messages, event messages, alerts, performance metrics, log data, API calls, or the like, combination thereof, with application servers or network monitoring computers.

Other examples of application programs include calendars, search programs, email client applications, IM applications, SMS applications, Voice Over Internet Protocol (VOIP) applications, contact managers, task managers, transcoders, database programs, word processing programs, security applications, spreadsheet programs, games, search programs, and so forth.

Additionally, in one or more embodiments (not shown in the figures), client computer 200 may include one or more embedded logic hardware devices instead of CPUs, such as, an Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA), Programmable Array Logic (PAL), or the like, or combination thereof. The embedded logic hardware devices may directly execute embedded logic to perform actions. Also, in one or more embodiments (not shown in the figures), client computer 200 may include one or more hardware microcontrollers instead of CPUs. In one or more embodiments, the microcontrollers may directly execute their own embedded logic to perform actions and access their own internal memory and their own external Input and Output Interfaces (e.g., hardware pins or wireless transceivers) to perform actions, such as System On a Chip (SOC), or the like.

Illustrative Network Computer

FIG. 3 shows one embodiment of network computer 300 that may be included in a system implementing at least one of the various embodiments. Network computer 300 may include many more or less components than those shown in FIG. 3. However, the components shown are sufficient to disclose an illustrative embodiment for practicing these innovations. Network computer 300 may represent, for example, one embodiment of at least one of application server computer 116, or network monitoring computer 118 of FIG. 1.

As shown in the figure, network computer 300 includes a processor 302 that may be in communication with a memory 304 via a bus 328. In some embodiments, processor 302 may be comprised of one or more hardware processors, or one or more processor cores. In some cases, one or more of the one or more processors may be specialized processors designed to perform one or more specialized actions, such as, those described herein. Network computer 300 also includes a power supply 330, network interface 332, audio interface 356, display 350, keyboard 352, input/output interface 338, processor-readable stationary storage device 334, and processor-readable removable storage device 336. Power supply 330 provides power to network computer 300.

Network interface 332 includes circuitry for coupling network computer 300 to one or more networks, and is constructed for use with one or more communication protocols and technologies including, but not limited to, protocols and technologies that implement any portion of the Open Systems Interconnection model (OSI model), global system for mobile communication (GSM), code division multiple access (CDMA), time division multiple access (TDMA), user datagram protocol (UDP), transmission control protocol/Internet protocol (TCP/IP), Short Message Service (SMS), Multimedia Messaging Service (MMS), general packet radio service (GPRS), WAP, ultra-wide band (UWB), IEEE 802.16 Worldwide Interoperability for Microwave Access (WiMax), Session Initiation Protocol/Real-time Transport Protocol (SIP/RTP), or any of a variety of other wired and wireless communication protocols. Network interface 332 is sometimes known as a transceiver, transceiving device, or network interface card (NIC). Network computer 300 may optionally communicate with a base station (not shown), or directly with another computer.

Audio interface 356 is arranged to produce and receive audio signals such as the sound of a human voice. For example, audio interface 356 may be coupled to a speaker and microphone (not shown) to enable telecommunication with others or generate an audio acknowledgement for some action. A microphone in audio interface 356 can also be used for input to or control of network computer 300, for example, using voice recognition.

Display 350 may be a liquid crystal display (LCD), gas plasma, electronic ink, light emitting diode (LED), Organic LED (OLED) or any other type of light reflective or light transmissive display that can be used with a computer. In some embodiments, display 350 may be a handheld projector or pico projector capable of projecting an image on a wall or other object.

Network computer 300 may also comprise input/output interface 338 for communicating with external devices or computers not shown in FIG. 3. Input/output interface 338 can utilize one or more wired or wireless communication technologies, such as USB™, Firewire™, WiFi, WiMax, Thunderbolt™, Infrared, Bluetooth™, Zigbee™, serial port, parallel port, and the like.

Also, input/output interface 338 may also include one or more sensors for determining geolocation information (e.g., GPS), monitoring electrical power conditions (e.g., voltage sensors, current sensors, frequency sensors, and so on), monitoring weather (e.g., thermostats, barometers, anemometers, humidity detectors, precipitation scales, or the like), or the like. Sensors may be one or more hardware sensors that collect or measure data that is external to network computer 300. Human interface components can be physically separate from network computer 300, allowing for remote input or output to network computer 300. For example, information routed as described here through human interface components such as display 350 or keyboard 352 can instead be routed through the network interface 332 to appropriate human interface components located elsewhere on the network. Human interface components include any component that allows the computer to take input from, or send output to, a human user of a computer. Accordingly, pointing devices such as mice, styluses, track balls, or the like, may communicate through pointing device interface 358 to receive user input.

GPS transceiver 340 can determine the physical coordinates of network computer 300 on the surface of the Earth, which typically outputs a location as latitude and longitude values. GPS transceiver 340 can also employ other geo-positioning mechanisms, including, but not limited to, triangulation, assisted GPS (AGPS), Enhanced Observed Time Difference (E-OTD), Cell Identifier (CI), Service Area Identifier (SAI), Enhanced Timing Advance (ETA), Base Station Subsystem (BSS), or the like, to further determine the physical location of network computer 300 on the surface of the Earth. It is understood that under different conditions, GPS transceiver 340 can determine a physical location for network computer 300. In one or more embodiment, however, network computer 300 may, through other components, provide other information that may be employed to determine a physical location of the client computer, including for example, a Media Access Control (MAC) address, IP address, and the like.

In at least one of the various embodiments, applications, such as, operating system 306, network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, may be arranged to employ geo-location information to select one or more localization features, such as, time zones, languages, currencies, calendar formatting, or the like. Localization features may be used when interpreting network traffic, monitoring application protocols, user-interfaces, reports, as well as internal processes or databases. In at least one of the various embodiments, geo-location information used for selecting localization information may be provided by GPS 340. Also, in some embodiments, geolocation information may include information provided using one or more geolocation protocols over the networks, such as, wireless network 108 or network 111.

Memory 304 may include Random Access Memory (RAM), Read-Only Memory (ROM), or other types of memory. Memory 304 illustrates an example of computer-readable storage media (devices) for storage of information such as computer-readable instructions, data structures, program modules or other data. Memory 304 stores a basic input/output system (BIOS) 308 for controlling low-level operation of network computer 300. The memory also stores an operating system 306 for controlling the operation of network computer 300. It will be appreciated that this component may include a general-purpose operating system such as a version of UNIX, or LINUX™, or a specialized operating system such as Microsoft Corporation's Windows® operating system, or the Apple Corporation's IOS® operating system. The operating system may include, or interface with a Java virtual machine module that enables control of hardware components or operating system operations via Java application programs. Likewise, other runtime environments may be included.

Memory 304 may further include one or more data storage 310, which can be utilized by network computer 300 to store, among other things, applications 320 or other data. For example, data storage 310 may also be employed to store information that describes various capabilities of network computer 300. The information may then be provided to another device or computer based on any of a variety of methods, including being sent as part of a header during a communication, sent upon request, or the like. Data storage 310 may also be employed to store social networking information including address books, buddy lists, aliases, user profile information, or the like. Data storage 310 may further include program code, data, algorithms, and the like, for use by a processor, such as processor 302 to execute and perform actions such as those actions described below. In one embodiment, at least some of data storage 310 might also be stored on another component of network computer 300, including, but not limited to, non-transitory media inside processor-readable removable storage device 336, processor-readable stationary storage device 334, or any other computer-readable storage device within network computer 300, or even external to network computer 300. Data storage 310 may include, for example, activity profiles 312, network topology database 314, protocol information 316, or the like. activity profiles 312 may be a database arranged for storing activity profiles that are associated with network activity that may occur in monitored networks. Network topology database 314 may be a data store that contains information related to the topology of one or more network monitored by a NMC, including one or more device relation models. And, protocol information 316 may store various rules or configuration information related to one or more network communication protocols, including application protocols, secure communication protocols, client-server protocols, peer-to-peer protocols, shared file system protocols, protocol state machines, or the like, that may be employed for protocol analysis, entity auto-discovery, anomaly detections, or the like, in a monitored network environment.

Applications 320 may include computer executable instructions which, when executed by network computer 300, transmit, receive, or otherwise process messages (e.g., SMS, Multimedia Messaging Service (MMS), Instant Message (IM), email, or other messages), audio, video, and enable telecommunication with another user of another mobile computer. Other examples of application programs include calendars, search programs, email client applications, IM applications, SMS applications, Voice Over Internet Protocol (VOIP) applications, contact managers, task managers, transcoders, database programs, word processing programs, security applications, spreadsheet programs, games, search programs, and so forth. Applications 320 may include network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, that may be arranged to perform actions for embodiments described below. In one or more of the various embodiments, one or more of the applications may be implemented as modules or components of another application. Further, in one or more of the various embodiments, applications may be implemented as operating system extensions, modules, plugins, or the like.

Furthermore, in one or more of the various embodiments, network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, may be operative in a cloud-based computing environment. In one or more of the various embodiments, these applications, and others, that comprise the management platform may be executing within virtual machines or virtual servers that may be managed in a cloud-based based computing environment. In one or more of the various embodiments, in this context the applications may flow from one physical network computer within the cloud-based environment to another depending on performance and scaling considerations automatically managed by the cloud computing environment. Likewise, in one or more of the various embodiments, virtual machines or virtual servers dedicated to network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, may be provisioned and de-commissioned automatically.

Also, in one or more of the various embodiments, network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, may be located in virtual servers running in a cloud-based computing environment rather than being tied to one or more specific physical network computers. Likewise, in some embodiments, one or more of network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, web services 329, or the like, may be configured to execute in a container-based environment.

Further, network computer 300 may also comprise hardware security module (HSM) 360 for providing additional tamper resistant safeguards for generating, storing or using security/cryptographic information such as, keys, digital certificates, passwords, passphrases, two-factor authentication information, or the like. In some embodiments, hardware security module may be employ to support one or more standard public key infrastructures (PKI), and may be employed to generate, manage, or store keys pairs, or the like. In some embodiments, HSM 360 may be a stand-alone network computer, in other cases, HSM 360 may be arranged as a hardware card that may be installed in a network computer.

Additionally, in one or more embodiments (not shown in the figures), network computer 300 may include one or more embedded logic hardware devices instead of CPUs, such as, an Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA), Programmable Array Logic (PAL), or the like, or combination thereof. The embedded logic hardware device may directly execute its embedded logic to perform actions. Also, in one or more embodiments (not shown in the figures), the network computer may include one or more hardware microcontrollers instead of CPUs. In one or more embodiments, the one or more microcontrollers may directly execute their own embedded logic to perform actions and access their own internal memory and their own external Input and Output Interfaces (e.g., hardware pins or wireless transceivers) to perform actions, such as System On a Chip (SOC), or the like.

Illustrative Logical System Architecture

FIG. 4 illustrates a logical architecture of system 400 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. System 400 may be arranged to include a plurality of network devices or network computers on first network 402 and a plurality of network devices or network computers on second network 404. Communication between the first network and the second network is managed by switch 406. Also, NMC 408 may be arranged to passively monitor or record packets (network packets) that are communicated in network flows between network devices or network computers on first network 402 and second network 404. For example, the communication of flows of packets between the Host B network computer and the Host A network computer are managed by switch 406 and NMC 408 may be passively monitoring and recording some or all of the network traffic comprising these flows.

NMC 408 may be arranged to receive network communication for monitoring through a variety of means including network taps, wireless receivers, port mirrors or directed tunnels from network switches, clients or servers including the endpoints themselves, virtual machine, cloud computing instances, other network infrastructure devices, or the like, or combination thereof. In at least some of the various embodiments, the NMC may receive a copy of each packet on a particular network segment or virtual local area network (VLAN). Also, for at least some of the various embodiments, NMCs may receive these packet copies through a port mirror on a managed Ethernet switch, e.g., a Switched Port Analyzer (SPAN) port, or a Roving Analysis Port (RAP). Port mirroring enables analysis and debugging of network communications. Port mirroring can be performed for inbound or outbound traffic (or both) on single or multiple interfaces. For example, in some embodiments, NMCs may be arranged to receive electronic signals over or via a physical hardware sensor that passively receives taps into the electronic signals that travel over the physical wires of one or more networks.

In one or more of the various embodiments, NMCs may be arranged to employ adaptive networking monitoring information including one or more device relation models that enable inference engines or analysis engines to correlate one or more causes and one or more effects associated with network activity of entities in the monitored networks. Also, in some embodiments, NMCs may be arranged to instantiate one or more network monitoring engines, one or more inference engines, one or more anomaly engines, or the like, to correlate one or more causes and one or more effects that may be associated with network activity.

FIG. 5 illustrates a logical schematic of system 500 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. In one or more of the various embodiments, an NMC, such as NMC 502 may be arranged to monitor network traffic in one or more networks, such as, network 504, network 506, or network 508. In this example, network 504, network 506, or network 508 may be considered similar to network 108 or network 110. Also, in some embodiments, one or more of network 504, network 506, or network 508 may be considered cloud computing environments. Likewise, in some embodiments, one or more of network 504, network 506, or network 508 may be considered remote data centers, local data centers, or the like, or combination thereof.

In one or more of the various embodiments, NMCs, such as NMC 502 may be arranged to communicate with one or more capture agents, such as, capture agent 512, capture agent 514, or capture agent 514. In some embodiments, capture agents may be arranged to selectively capture network traffic or collect network traffic metrics that may be provided to NMC 502 for additional analysis.

In one or more of the various embodiments, capture agents may be NMCs that are distributed in various networks or cloud environments. For example, in some embodiments, a simplified system may include one or more NMCs that also provide capture agent services. In some embodiments, capture agents may be NMCs arranged to instantiate one or more capture engines to perform one or more capture or collection actions. Similarly, in one or more of the various embodiments, one or more capture agents may be instantiated or hosted separately from one or more NMCs.

In one or more of the various embodiments, capture agents may be selectively installed such that may capture metrics for select portions of the monitored networks. Also, in some embodiments, in networks that have groups or clusters of the same or similar entities, capture agents may be selectively installed on one or more entities that may be representative of entire groups or clusters pf similar entities. Thus, in some embodiments, capture agents on the representative entities may collect metrics or traffic that may be used to infer the metrics or activity associated with similarly situated entities that do not include a capture agent.

Likewise, in one or more of the various embodiments, one or more capture agents may be installed or activated for a limited time period to collect information that may be used to infer activity information about the monitored networks. Accordingly, in one or more of the various embodiments, these one or more capture agents may be removed or de-activated if sufficient activity information or network traffic has been collected.

In one or more of the various embodiments, system 500 may include one or more network entities, such as, entities 518, entities 520, or the like, that communicate in or over one or more of the monitored networks. Entities 518 and entities 520 are illustrated here as cloud environment compute instances (e.g., virtual machines), or the like. However, one of ordinary skill in the art will appreciate that entities may be considered to be various network computers, network appliances, routers, applications, services, or the like, subject to network monitoring by one or more NMCs. (See, FIG. 4, as well).

In this example, for one or more of the various embodiments, capture agents, such as capture agent 512 may be arranged capture network traffic or network traffic metrics associated with one or more entities, such as, entities 518. Accordingly, in some embodiments, some or all of the information captured by capture agents may be provided to one or more NMCs, such as, NMC 502 for additional analysis. Also, in one or more of the various embodiments, capture agents or NMCs may be arranged to selectively store network traffic in a captured data store, such as, captured data store 522.

FIG. 6 illustrates a logical representation of network 600 in accordance with at least one of the various embodiments. In at least one of the various embodiments, network 602 represents a physical network and the entities in the network. In this example, network 602 includes, network computers 604, client computers 606, network devices, such as, network device 610, and other items, such as, Wi-Fi hotspot 608. One of ordinary skill in the art will appreciate that networks may have many more or different devices than shown in FIG. 6.

In at least one of the various embodiments, one or more network monitoring computers (NMCs) may be arranged to monitor networks, such as, network 602. (See, FIG. 4). In at least one of the various embodiments, NMCs may be arranged to generate one or more device relation models that represent the entities in a network. For example, device relation model 612 represents a device relation model corresponding to network 602. Accordingly, device relation model 612 includes nodes that represent the various entities that may be active in network 602. For example, entities 614, may represent some of the entities that are operative in network 602. In some embodiments, there may be more entities in model 612 than the number of actual computers and network devices present in network 602 since many network computers/devices may host more than one entity. For example, in some embodiments, a single network computer may host a web server and a database server. Accordingly, in this example, three entities may be included in the device relation model, one for the web server, one for the database server, and one for the network computer itself.

In this example, device relation model 612 shows nodes that correspond to entities absent any edges. In some embodiments, initially some or all of the relationships between the entities may be unknown to the monitoring NMC, so some or all of the edges may be unknown and therefor omitted from device relation model 612. Note, in at least one of the various embodiments, there may be pre-defined network architecture/topology information that may be available to the NMC. Accordingly, in some embodiments, the NMC may be able to determine some of the relationships between entities before observing network traffic.

FIG. 7 illustrates a logical representation of a portion of device relation model 700 in accordance with at least one of the various embodiments. In at least one of the various embodiments, device relation models may include nodes that represent entities and edges that represent relationships between the entities. In some embodiments, entities may represent servers, clients, switches, routers, NMCs, load balancers, applications, services, or the like. For example, entity 702 may be a server entity that has relationships with other servers, such as, entity 704 and entity 706. Likewise, entity 708 may be a server or other service that has a relationship with entity 704, entity 706, and entity 702. Further, entity 704 and entity 710 may have a relationship and client entities 712 may have direct relationships with entity 710.

In at least one of the various embodiments, NMCs may be arranged to use device relation model 700 to discover relationships between groups of entities. For example, device relation model 700 may be used to determine that entity 702, entity 704, 710, and client 712 may be in a related group because they are all on the same path through the graph.

In one or more of the various embodiments, one or more device relation models may be generated to represent different dimensions or concepts that may relate the one or more entities included in a model. For example, one device relation model may represent general dependencies among entities while another device relation model may be arranged to represent administration dependencies that show which entities may be arranged to administer other entities.

FIGS. 8A and 8B illustrate how a device relation model may evolve as the NMCs gather more information about the relationships between the entities in a network.

FIG. 8A illustrates a logical representation of device relation model 800 showing naïve relationships between the entities in accordance with the one or more embodiments. In at least one of the various embodiments, for example, a NMC may initially determine the entities in a network by observing the network traffic to obtain the source/destination network address fields in the network packets that flow through the network. In at least one of the various embodiments, each unique network address may represent a different entity in the network.

Likewise, in some embodiments, the NMC may be arranged to observe responses to broadcast messages, or the like. Additionally, in some embodiments, the NMC may be provided other configuration information (e.g., information provided by a configuration management database) that defines some or all of the entities in the network.

In this example, for at least one of the various embodiments, the NMC has discovered/identified six entities in the network (entity 802 through entity 812). Accordingly, in some embodiments, the NMC may be arranged to generate a device relation model, such as, device relation model 800 that represents the six discovered entities as nodes in the graph. Likewise, in some embodiments, edges in device relation model 800 may represent the initial relationships as determined by the NMC. For example, in the initial stages of monitoring a network the NMC may be arranged to determine relationships based on which entities are observed to be communicating with each other.

However, in at least one of the various embodiments, NMCs may be arranged to provide a device relation model that represents the relationships between the entities that go beyond simple interconnectivity. Initially, in some embodiments, the NMC may define the initial relationships in the network based on which entities communicate with each other. However, in at least one of the various embodiments, as the NMC collects more information about the entities and their relationships to other entities, the NMC may modify device relation model 800 to reflect the deeper understanding of these relationships.

FIG. 8B illustrates a logical representation of device relation model 800 showing informed relationships between the entities in accordance with the one or more embodiments. In at least one of the various embodiments, after sufficient monitoring has occurred, the NMC may have observed enough network traffic to evaluate and weight the relationships of the entities in the network.

In at least one of the various embodiments, some of the initial relationships may be determined to be incidental, spurious, or otherwise unimportant. Accordingly, the NMC may be arranged to remove (or de-prioritize) edges from device relation model 800 that correspond to such relationships. For example, in at least one of the various embodiments, entities (e.g., Windows network domain controllers) in a network may be arranged to periodically exchange messages with one or more other entities for discovery/accountability purposes. Thus, in this example, some of the messaging observed by an NMC may be routine and otherwise not resulting from an interesting relationships between the sender and receiver.

In at least one of the various embodiments, NMC may be arranged to evaluate the communication between entities to attempt to determine the type of relationships and the importance of the relationships. Accordingly, in at least one of the various embodiments, NMCs may be arranged to collected metrics associated with the various network flows flowing the network to identify the flows that may be important. Likewise, in at least one of the various embodiments, NMC may be arranged discover and recognize the communication protocols used by entities in monitored networks. In some embodiments, the NMCs may be arranged to use the collected metrics and its understanding of the communication protocol to establish or prioritize relationships between the entities in the networks.

In this example, for at least one of the various embodiments, device relation model 800 has been modified to include relationships determined to be of importance. The nodes representing entities 802-812 are still present in but some of the edges that represent relationships in the network have been removed. For example, in FIG. 8A, device relation model 800 includes an edge between entity 804 and entity 812. In FIG. 8B, device relation model 800 omits the edge between entity 804 and entity 812.

In at least one of the various embodiments, the remaining edges in device relation model 800 represent relationships between the entities that the NMC determined to be important for a given device relation model. Note, in at least one of the various embodiments, an NMC may employ a variety of metrics, conditions, heuristics, or the like, to identify relationships that may be of interest. For example, an NMC may be arranged to identify entities that represent certain applications on the network, such as, database servers, database clients, email servers, email clients, or the like, by identifying the communication protocols that may be used by the particular applications. In other cases, the NMC may determine an important relationship based on the number or rate of packets exchanged between one or more entities. Accordingly, the NMC may be configured to prioritize relationships between entities that exchange a high volume of traffic.

In at least one of the various embodiments, the NMC may analyze observed traffic to identify network packets that flow through particular paths in the device relation model. In some embodiments, NMCs may be arranged to trace or identify such paths connecting related entities by observing common data carried in the payloads or header fields of the network packets that are passed among entities in the network. For example, an NMC may be arranged to observe sequence numbers, session identifiers, HTTP cookies, query values, or the like, from all entities participating in transactions on the network. In some embodiments, the NMC may correlate observed network packets that may be requests and responses based on the contents of the network packets and known information about the operation of the underlying applications or protocols.

FIGS. 9A and 9B provide additional illustration of how a device relation model may evolve as the NMCs gather more information about the relationships between the entities in a network.

FIG. 9A illustrates a logical representation of device relation model 900 showing relationships between the entities based on observed network connections in accordance with the one or more embodiments. In at least one of the various embodiments, the NMC has provided device relation model 900 that represents the relationships between entity 902 through entity 912. Here device relation model 900 shows relationships that may be associated with actual network links (e.g., physical links or virtual links) between the entities in the network. For example, the edges in device relation model 900 may correspond to network flows that have been observed in the network. In some embodiments, an NMC may readily deduce these types of connection relationships by examining the source/destination fields in network packets observed in the network. Accordingly, in this example, entity 906 may have been observed exchanging data with entity 908 over the network.

FIG. 9B illustrates a logical representation of device relation model 900 showing phantom edges that represent relationships between the entities in accordance with the one or more embodiments. In some embodiments, networks may include entities that have important logical/operational relationships even though they do not exchange network packets directly with each other. Here, the NMC has discovered relationships between entity 902 and entity 908 even though they do not communicate directly with each other. Likewise, the NMC has discovered relationships between entity 904 and entity 912 even though they do not communicate directly with each other. Similarly, entity 908, entity 910, entity 912 have been found to be related even though there is no direct network link or direct communication between them.

In at least one of the various embodiments, the NMC may be arranged to represent such relationships using phantom edges. Phantom edges may represent relationships between entities that do not correspond to direct network links. For example, entity 902 and entity 904 may be database clients and entity 908, entity 910, and entity 912 may be database servers. In this example, entity 902 and entity 904 access the database servers through entity 906. In this example, entity 906 may be proxy-based load balancer of some kind. Accordingly, in this example there is no direct network link between the database clients and the database servers. Nor, as represented, do the database server entities (entity 908, entity 910, and entity 912) have direct connections to each other.

But, in some embodiments, the NMC may determine that the three database server entities (entity 908, entity 910, and entity 912) are related because they are each receiving communications from the same load balancer (entity 906). Likewise, the NMC may determine a relationship between the database clients (entity 902 and entity 904) and the database servers (entity 908, entity 910, and entity 912) by observing the operation of the database transactions even though they do not communicate directly with each other.

FIG. 10 illustrates a logical architecture of network 1000 that includes entities in accordance with the one or more embodiments. In at least one of the various embodiments, networks may include several (100s, 1000s, or more) computers or devices that may put network traffic on the network. As described above, (See, FIG. 4 and FIG. 5) network monitoring computers (NMCs) may be arranged to passively monitor the network traffic. In some embodiments, NMCs (not shown in FIG. 10) may have direct access to the wire traffic of the network enabling NMCs to access all of the network traffic in monitored networks.

In at least one of the various embodiments, the NMC may be arranged to identify entities in the network. Entities may include applications, services, programs, processes, network devices, or the like, operating in the monitored network. For example, individual entities may include, web clients, web servers, database clients, database servers, mobile app clients, payment processors, groupware clients, groupware services, or the like. In some cases, multiple entities may co-exist on the same network computer, or cloud compute instance.

In this example, client computer 1002 may be hosting web client 1004 and DNS client 1006. Further, server computer 1008 may be hosting web server 1010, database client 1014, and DNS client 1021. Also, in this example: server computer 1016 may be arranged to host database server 1018 and authorization client 1020; server computer 1022 may be arranged to host authorization server 1024; and server computer 1026 may be arranged to DNS server 1028.

In at least one of the various embodiments, some or all of the applications on a computer may correspond to entities. Generally, applications, services, or the like, that communicate using the network may be identified as entities by an NMC. Accordingly, there may be more than one entity per computer. Some server computers may host many entities. Also, some server computers may be virtualized machine instances executing in a virtualized environment, such as, a cloud-based computing environment. Likewise, one or more servers may running in containerized compute instances, or the like.

In at least one of the various embodiments, an individual process or program running on a network computer may perform more than one type of operation on the network. Accordingly, some processes or programs may be represented as more than one entity. For example, a web server application may have an embedded database client. Thus, in some embodiments, an individual web server application may contribute two or more entities to the device relation model.

In at least one of the various embodiments, the NMC may be arranged to monitor the network traffic to identify the entities and to determine their roles. In at least one of the various embodiments, the NMC may monitor the communication protocols, payloads, ports, source/destination addresses, or the like, or combination thereof, to identify entities.

In at least one of the various embodiments, the NMC may be preloaded with configuration information that it may use to identify entities and the services/roles they may be performing in the network. For example, if an NMC observes a HTTP GET request coming from a computer, it may determine there is a web client entity running on the host. Likewise, if the NMC observes a HTTP 200 OK response originating from a computer it may determine that there is a web server entity in the network.

In at least one of the various embodiments, the NMC may use some or all of the tuple information included in network traffic to distinguish between different entities in the network. Further, the NMC may be arranged to track the connections and network flows established between separate entities by correlating the tuple information of the requests and responses between the entities.

FIG. 11 illustrates a logical representation of a data structure for device relation model 1100 that includes entities in accordance with the one or more embodiments. In at least one of the various embodiments, network monitoring computers (NMCs) may be arranged generate device relation models, such as, device relation model 1100. In this example, device relation model 1100 represents the entities discovered network 1000 shown in FIG. 10.

In at least one of the various embodiments, as described above, NMCs may arrange device relation models to represent the relationship the entities have to each other rather than just modeling the network topology. For example, entity 1106, entity 1110, and entity 1118 are each related to the DNS system in network 1000. Therefore, in this example, for some embodiments, the NMC may arrange device relation model 1100 such that all of the DNS related entities (entity 1106, entity 1110, and entity 1118) are neighbors in the graph. Accordingly, in some embodiments, even though entity 1106 corresponds to DNS client 1006 on client computer 1002, the NMC may group entity 1106 with the other DNS entities rather than put it next other entities in the same computer.

In at least one of the various embodiments, the NMC may be arranged to generate device relation model 1100 based on the relationships that the entities have with each other. Accordingly, in some embodiments, the edges in the graph may be selected or prioritized (e.g., weighted) based on the type or strength of the relationship. In at least one of the various embodiments, the metrics used for prioritizing the edges in a device relation model may be selected/computed based on configuration information that includes rules, conditions, pattern matching, scripts, computer readable instructions, or the like. In some embodiments, NMCs may be arranged to apply this configuration information to the observed network packets (e.g., headers, payloads, or the like) to identify and evaluate relationships.

In at least one of the various embodiments, in device relation model 1100, the edge connecting entity 1104 and entity 1108 is depicted thicker to represent the close relationship the web server entity has with the database client entity. This reflects that the web server may be hosting a data centric web application that fetches data from a database when it receives HTTP requests from clients. Likewise, for device relation model 1100 the relationship between the database client (entity 1108) and the database server (entity 1112) is also a strong relationship. Similarly, the relationship between the authorization client (entity 1114) and the authorization server (entity 1116) is a strong relationship.

Also, in this example, the client (entity 1102) and DNS client 1 (entity 1106) have a strong relationship and it follows that DNS client 1 (entity 1106) has a strong relationship with the DNS server (entity 1118). However, DNS client 2 (entity 1110) has a weak relationship with the DNS server (entity 1118). In this example, this may make sense because DNS client 1 (entity 1106) is often used by the client (entity 1102) to send lookup requests to the DNS server. In contrast, in this example, DNS client 2 (entity 1110) is rarely used since it is running on the server computer (server computer 1008 in FIG. 10) and it may rarely issue name lookup requests.

In at least one of the various embodiments, the NMC may traverse device relation model 1100 to identify entities that may be closely related together and associate them into a group. For example, in some embodiments, in device relation model 1100, entity 1104, entity 1108, and entity 1112 may be grouped since they each have strong relationships with each other.

Accordingly, in at least one of the various embodiments, the NMC may be arranged to correlate error signals that may be associated with one or more entities that are in the same group to determine that an anomaly may be occurring. Related error signals that may propagate through a group of closely related entities may be recognized as a bigger problem that rises to an actual anomaly.

In at least one of the various embodiments, the NMC may be arranged to have configuration information, including, templates, patterns, protocol information, or the like, for identifying error signals in a group that may have correlations that indicate they indicate an anomaly.

In some embodiments, the NMC may be arranged to capture/monitor incoming and outgoing network traffic for entities in a monitored network. Also, the NMC may be arranged to employ various protocol analysis facilities, such as, state machines, mathematical models, or the like, to track expected/normal operations of different types of entities in a monitored network. Accordingly, in at least one of the various embodiments, the NMC may monitor the state of operations for entities that are working together. For example, a web client entity, such as, entity 1102, may make an HTTP request to web server entity 1104, that in turn triggers the web server entity 1104 to issue a database request to DB client entity 1108 that in turn is provided database server entity 1112. In some embodiments, the NMC may monitor the operation of each entity in the group by observing the network traffic exchanged between the entities in a group. Thus, in some embodiments, if an error at database server entity 1112 causes web client entity 1102 to drop its connection because of a timeout (or the user cancel the request, or repeats the same request before the response is sent), the NMC may be able to correlate the error at database server entity 1112 with the “timeout” error at web client entity 1102 to recognize what may be a serious anomaly.

FIG. 12 represents a logical representation of system 1200 for transforming monitored network traffic into activity profile objects in accordance with one or more of the various embodiments. In one or more of the various embodiments, NMC 1202 may be arranged to monitor network traffic 1204. As described, in some embodiments, NMC 1202 may be arranged to provide various metrics associated with monitored network traffic 1204.

In one or more of the various embodiments, an NMC may be arranged to transform one or more collected metrics into activity profiles suitable for training of correlation models. Likewise, in one or more of the various embodiments, activity profiles may be provided to an inference engine to identify one or more correlations between or among one or more activity profiles based on one or more correlation models.

Accordingly, in one or more of the various embodiments, as described above, NMCs such as, NMC 1202 may be arranged to collect metrics, portions of the traffic, traffic attributes, or the like, from monitored network traffic and arrange them into activity profiles. In one or more of the various embodiments, activity profiles may include collections of one or more fields with values that may be based on network traffic 1204 or metrics associated with network traffic 1202. In one or more of the various embodiments, one or more of the metrics included in an activity profile may correspond to metrics collected by the NMC. In other embodiments, one or more of the metrics included in an activity profile may be composed of two or more metrics. Also, in one or more of the various embodiments, one or more metrics or attributes of an activity profile may be computed based on one or more observed metrics.

Further, in one or more of the various embodiments, metric values included in activity profiles may be normalized or fit to a common schema as well as arithmetically normalized. Normalizing metric values to a common schema may include bucketing values. For example, in some embodiments, observed metrics that have continuous values may be mapped to named buckets, such as high, medium, low, or the like.

In one or more of the various embodiments, NMCs may be arranged to execute one or more ingestion rules to perform pre-processing, such as, the data normalization, that may be required to map observed (raw) metrics into activity profile field values. In one or more of the various embodiments, one or more ingestion rules may be built-in to NMCs while other ingestion rules may be provided via configuration information, plug-ins, rule based policies, user input, or the like.

In one or more of the various embodiments, pre-processing may include identifying one or more literal values or parameter values that may be different even though the activity associated with the network traffic may be considered the same or similar. For example, in some embodiments, the monitored network activity may include requests that have the same or similar structure, such as, GET/foo/bar?ID=12234 where the value for the ID parameter may vary per request while the rest of the request remains the unchanged. Accordingly, in this example, in one or more of the various embodiments, an activity profile for the request may include, among other things “Protocol: HTTP; Function: GET; URL: /foo/bar?ID=<integer>,” and so on. Thus, in this example, a request such as GET/foo/bar?ID=5678 may match the profile while a request such as GET/foo/bar?time=00001 may not match the profile.

In one or more of the various embodiments, one or more activity profiles may be associated with computer readable instructions that enforce one or more matching rules or filter rules. Accordingly, in one or more of the various embodiments, inference engines may be arranged to execute the one or more matching rules or filters to determine if observed network activity should be associated with an activity profile. In some embodiments, one or more matching rules may be comprised of pattern matching instructions, such as, regular expressions, or the like. In some embodiments, one or more matching rules may be comprised one or more compound or cascading rules or sub-rules for matching network activity to activity profiles. In some embodiments, the same network activity may match two or more activity profiles.

In one or more of the various embodiments, activity profiles may be employed by inference engines, analysis engines, anomaly engines, or the like, for correlating activity associated with one or more entities with one or more activities that may be associated with one or more other entities.

FIGS. 13A and 13B illustrate a logical representation of system 1300 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments.

FIG. 13A illustrates a logical representation of system 1300 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. In one or more of the various embodiments, one or more NMCs such as NMC 1302, NMC 1304, or the like, may be arranged to monitor network traffic that may be associated with many different devices or entities in one or more networks. In one or more of the various embodiments, NMC 1302 may be arranged to monitor network traffic originating from entity 1306 that travels through network path 1308 to entity 1310. Also, in this example, NMC 1304 may be arranged to monitor the network traffic from entity 1310 that travels through network path 1312 to entity 1314.

Accordingly, in one or more of the various embodiments, NMC 1302 may observe the network traffic associated with activity profile 1320. Likewise, in this example, NMC 1304 may observe network traffic that may be associated with activity profile 1322. Further, in one or more of the various embodiments, the one or more NMCs may also observe that entity 1314 sends requests 1316 to entity 1318.

In one or more of the various embodiments, an inference engine, such as, inference engine 324 may be arranged to determine one or more correlations between activity profile 1320 and activity profile 1322 that results in request 1316 being provided to entity 1318. This inference process is described below in more detail, however, briefly, NMCs may be arranged to pre-process activity traffic into features that comprises activity profiles. Accordingly, the activity profiles may be used to identify similar activity that may occur in the monitored networks. Also, in some embodiments, inference engines may be arranged learn one or more correlations that may be associated with the determined activity profiles. In this example, activity profile 1320 may correspond to a user directing her web browser to a dashboard portion of a web application running on entity 1310 (a web server). In this example, in response to a request to view a user's dashboard, the web application may generate a request that matches profile 1322. In this example, activity profile 1322 is a request for user related information that may be used to populate the individual user's dashboard. In this example, the request associated with activity profile 1322 results in another request which may be a query to entity 1318 (a database server) to collect the information to send back to the entity 1310. Accordingly, in one or more of the various embodiments, the inference engine may determine that activity profile 1320 may be correlated with activity profile 1322.

FIG. 13B illustrates a logical representation of system 1300 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. In one or more of the various embodiments, one or more NMCs such as NMC 1302, NMC 1304, or the like, may be arranged to monitor network traffic that may be associated with many different devices or entities in one or more networks. In one or more of the various embodiments, NMC 1302 may be arranged to monitor network traffic originating from entity 1306 that travels through network path 1308 to entity 1310. Also, in this example, NMC 1304 may be arranged to monitor the network traffic from entity 1310 that travels through network path 1312 to entity 1314.

Accordingly, in one or more of the various embodiments, NMC 1302 may observe the network traffic associated with activity profile 1324. Likewise, in this example, NMC 1304 may observe network traffic that may be associated with activity profile 1326. Further, in one or more of the various embodiments, the one or more NMCs may also observe that entity 1314 makes several requests, such as, request 1328 entity 1318.

In one or more of the various embodiments, the inference engine may be arranged to determine a correlation between activity profile 1324 and activity profile 1326 based on the several requests being provided to entity 1318. In this example, activity profile 1324 may correspond to a user directing her web browser to provide a report via a web application running on entity 1310 (a web server). In this example, in response to a request to view the report, the web application may generate a request that matches profile 1326. In this example, activity profile 1326 is a request for a list of information that may be used to populate the report requested by the user. In this example, the request associated with activity profile 1326 results in several additional requests to entity 1318 (a database server) to collect the information for the report. Accordingly, in one or more of the various embodiments, the inference engine may determine that activity profile 1324 may be correlated with activity profile 1326. In this example, the inference engine may be arranged to compare the timing of the chain of requests that results in the many requests associated with activity 1328. In this example, based on the strength of the correlation between activity associated with activity profile 1324 and activity 1328, the inference engine may generate report information the indicates that activity associated with activity profile 1324 may cause the many requests associated with activity 1328.

FIG. 14 illustrates a logical schematic of system 1400 for monitoring network traffic that may be associated with network activity that may be associated with one or more activity profiles. In this example, NMC 1402 may be arranged to monitor network traffic between entity 1404 and entity 1406 to determine profile information 1408 based on one or more requests or responses. In one or more of the various embodiments, system 1400 illustrates that the activity and the associate activity profiles may be observed between or among any entity in the monitored networks as long as an NMC is position or configured to observe the network traffic or metrics that may be associated with the network activity.

In one or more of the various embodiments, some or all of the monitored metrics or some or all of the network traffic associated with network activities may be captured or stored. Accordingly, in one or more of the various embodiments, this data or information may be used in concert with one or more device relation models to generate correlations models that may identify one or more correlations between network activity that may otherwise appear to be independent.

FIG. 15A illustrates a logical schematic of requests and responses that may be associated with an entity in a monitored network in accordance with one or more of the various embodiments. In one or more of the various embodiments, entities, such as, entity 1502, may be associated with a variety of network traffic, such as, incoming request 1504, outgoing response 1506, outgoing request 1508, incoming response 1510, or the like. In one or more of the various embodiments, these requests or responses may include attributes or network traffic/content that may be used by an inference engine to associate them with activity profiles (e.g., request profiles or response profiles) that represent a class or category of network activity that may be associated with entity 1502.

FIG. 15B illustrates a logical schematic of data structure 1512 that may be used to capture information that may be used by an inference engine to determine correlations between different activity profiles in accordance with one or more of the various embodiments. In this example, data structure 1512 include timestamp column 1514 and profile column 1516. In this example, for some embodiments, NMCs may be arranged to capture timestamps that may record when network traffic associated with a given activity profile is observed in the one or more monitored networks. In this example, row 1518 indicates that at timestamp t0, network traffic matching profile A was observed. Likewise, in this example, row 1520 indicates that at timestamp t1, network traffic matching profile B was observed. Accordingly, in one or more of the various embodiments, data structure 1512 enables an inference engine to generate inferences regarding one or more of the relationships or correlations between the activity profiles. For example, an inference engine may be arranged to analyze the correlation of activity profile A to activity profile B by observing or discovering dependencies or relationships associated with the activity profiles. Here, for example, an inference engine may be arranged to measure or analyze the correlation between activity profile A and activity profile B based on the observation that network traffic matching activity profile B may be observed to come after network traffic matching activity profile A. In this example, the strength of the correlation may be computed by determining how often the traffic matching activity profile B comes after traffic that matches activity profile A, or the like.

FIG. 15C illustrates a logical schematic of activity profile correlations that may be discovered by the inference engine based on activity profiles and the network traffic associated with the activity profiles. In some embodiments, one or more inference engines may be arranged to perform statistical analysis to identify correlations or dependencies among two or more activity profiles. In this example, for some embodiments, the information in data structures, such as, data structure 1512 may yield correlation information that supports the inference that network activity associated with activity profile 1522 may trigger network activity associated with activity profile 1524 which may in turn trigger network activity associated with activity profile 1526 and trigger network activity that may match activity profile 1528.

Accordingly, in one or more of the various embodiments, the inference engine may determine the strength of the correlation between activity profile 1522 and activity profile 1528. In one or more of the various embodiments, this may enable downstream network activity that would otherwise appear unassociated with any particular upstream network activity to be identified. In one or more of the various embodiments, this may enable upstream network activity that would otherwise appear unassociated with any particular downstream network activity to be identified based on the strength of the correlation of the activity profiles associated with network activity observed in the network.

Generalized Operations

FIGS. 16-20 represent generalized operations for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. In one or more of the various embodiments, processes 1600, 1700, 1800, 1900, and 2000 described in conjunction with FIGS. 16-20 may be implemented by or executed by one or more processors on a single network computer (or network monitoring computer), such as network computer 300 of FIG. 3. In other embodiments, these processes, or portions thereof, may be implemented by or executed on a plurality of network computers, such as network computer 300 of FIG. 3. In yet other embodiments, these processes, or portions thereof, may be implemented by or executed on one or more virtualized computers, such as, those in a cloud-based environment. However, embodiments are not so limited and various combinations of network computers, client computers, or the like may be utilized. Further, in one or more of the various embodiments, the processes described in conjunction with FIGS. 16-20 may be used for correlating causes and effects associated with network activity based on network behavior in accordance with at least one of the various embodiments or architectures such as those described in conjunction with FIGS. 4-15. Further, in one or more of the various embodiments, some or all of the actions performed by processes 1600, 1700, 1800, 1900, and 2000 may be executed in part by network monitoring engine 322, inference engine 324, analysis engine 326, anomaly engine 327, or the like, running on one or more processors of one or more network computers.

FIG. 16 illustrates an overview flowchart of process 1600 for correlating causes and effects associated with network activity in accordance with one or more of the various embodiments. After a start block, at block 1602, in one or more of the various embodiments, one or more NMCs may be arranged to collect one or more metrics or other information based on monitoring the network traffic in the monitored networks. As described above, NMCs may be arranged to monitor the network traffic associated with various entities in the monitored networks. In some embodiments, the NMCs may employ some or all of the information collected during monitoring to generate one or more device relation models, activity models, correlation models, or the like.

At block 1604, in one or more of the various embodiments, the one or more NMCs may instantiate one or more inference engines to perform actions to associate monitored network traffic with one or more activity profiles. In one or more of the various embodiments, the inference engines may analyze various inputs, such as user feedback, user activity, customer activity, one or more device relation models, or the like, that may be applied to one or more matching rules to associate network activity with activity profiles. Note, in one or more of the various embodiments, the inference engine may be configured to exclude or include one or more entities or one or more portions of the monitored networks from the activity profile matching process.

In one or more of the various embodiments, the inputs to the inference engine are not limited to metrics or conditions that were directly monitored by the NMC. Accordingly, in one or more of the various embodiments, the inference engine may be provided inputs from other services or applications that may be provided via one or more interfaces or APIs. Likewise, in one or more of the various embodiments, the inference engine may be arranged to actively employ one or more interfaces or APIs to obtain information from one or more other applications or services to employ as inputs.

At block 1606, in one or more of the various embodiments, the one or more NMCs may be arranged to analyze the network activity associated one or more entities in the monitored networks based one or more activity profiles. In one or more of the various embodiments, the results of the analysis may be communicated to one or more services or components that may be arranged to interpret the results to determine one or more responses or actions. In one or more of the various embodiments, one or more alerts or one or more events may be generated in response to particular results. Likewise, in some embodiments, the analysis results may be incorporated into various reports, including visualizations that may be arranged to inform users about activity occurring in the monitored networks, or the like.

In some embodiments, activity profiles that are determined to be highly correlated may be linked such that events associated with an activity profile may be associated with its one or more linked activity profiles. Accordingly, an NMC may be arranged to monitor network activity associated with one activity profile while triggering events associated with one or more activity profiles that may be linked. For example, if activity profile A and activity profile B are correlated, the NMC may be arranged to monitor network traffic associated with activity profile A to determine if events for entities associated with activity profile B should be generated in addition or instead of just generating events for the entities associated with activity profile A.

Thus, in one or more of the various embodiments, the one or more NMCs may be arranged to enforce one or more one or more policies associated with the activity profiles. In some embodiments, enforcement may result in the detection of anomalies that may be reported to one or more other services or facilities. Likewise, in some embodiments, enforcement of policies associated with activity profiles may include triggering or recommending one or more capacity planning actions. For example, the occurrence or discovery of network activity associated with one or more activity profiles may indicate that one or more resources in the network should be increased or decreased.

Further, in some embodiments, analysis of network activity based on activity profiles may be employed to trace one or more transactions that traverse more than one network devices, services, applications, or computers in the monitored networks. For example, in some embodiments, one or more activity profiles may be associated with one or more correlations that may be discovered between two or more requests or responses associated with different network devices, services, applications, or computers. In some embodiments, an activity profile may be provided to characterize one or more requests or responses occurring at different devices as being legs of the same transaction. Next, control may be returned to a calling process.

FIG. 17 illustrates a flowchart of process 1700 for matching network activity with activity profiles for correlating causes and effects associated with the network activity in accordance with one or more of the various embodiments. After a start block, at block 1702, in one or more of the various embodiments, one or more NMCs may be arranged to monitor network traffic associated with one or more network activities in one or more monitored networks.

At block 1704, in one or more of the various embodiments, the one or more NMCs may be arranged to determine one or more network activities based on the monitored network traffic. In one or more of the various embodiments, NMCs may be arranged to select network traffic to associate with network activities. In some cases, a portion of the network traffic associated with an activity may be of interest. For example, in some embodiments, network traffic associated with a request from an external client that is directed to an entity in the monitored network may be considered network activity of interest. Also, in one or more of the various embodiments, just a portion of the network traffic associated with a response may be considered a relevant network activity. In some embodiments, response traffic associated with errors or unexpected results may be network activity considered of interest. For example, in one or more of the various embodiments, network traffic associated with a client's request to stream a movie may be interesting network activity. However, in some embodiments, the two hours of video traffic provided to the client might not be considered interesting network activity. In contrast, if the response to a request to stream video is an error response, it may be considered relevant network activity that may require further analysis.

Accordingly, in one or more of the various embodiments, NMCs may be arranged to apply one or more filters or rules that may determine network activities from the overall monitored traffic. In one or more of the various embodiments, this may improve the performance of the NMC by removing some or much network traffic from consideration with respect to matching traffic with activity profiles. For example, an NMC may be arranged to consider HTTP requests as being of interest. Accordingly, in this example, the NMC may be configured to ignore network traffic that is unassociated with HTTP requests. In some embodiments, other criteria may be used to filter traffic down to network activity of interest, including, source or destination tuple information, application type, or the like. In some embodiments, an NMC may be arranged to perform deep packet inspection to identify network activity of interest from network traffic.

Likewise, in one or more of the various embodiments, the criteria for determining network activity of interest may include or be based on one or more performance metrics, such as, packet size, bit rate of the flow, protocol, encryption status (e.g., state, version, type, cipher, or the like), error rate, device relations (e.g., device relation models), time-of-day, day-of-week, geolocation information, or the like.

Further, in one or more of the various embodiments, as described above, the one or more NMCs may be arranged to monitor network activity in the context of network flows, session flows, related flows, or the like, or combination thereof. For example, in some cases, network activity of interest may be associated with more than one network packet that may comprise particular network flows. Accordingly, in some embodiments, a request may include one or more packets in network flow A and the response may include one or more network packets in a related network flow. Or, in one or more of the various embodiments, network activity may be identified based on a plurality of network packets that are in the same network flow. For example, multiple network packets having various characteristics sent over the same network flow may be required to determine network activity that is of interest. Thus, in some embodiments, the features of one or more network flows may be employed to identify or discover correlations that may be used to define one or more activity profiles. Likewise, in some embodiments, the detection of network flows having one or more features that correspond to various metrics or network characteristics may identify network activity that may be associated with one or more activity profiles.

In at least one of the various embodiments, the metrics or traffic characteristics used for determining network activity that may be of interest may be based on configuration information that includes rules, conditions, pattern matching, scripts, computer readable instructions, or the like. In some embodiments, NMCs may be arranged to apply this configuration information to the observed network packets (e.g., packets, headers, payloads, flows, or the like) to evaluate the network traffic.

At block 1706, in one or more of the various embodiments, the one or more NMCs may be arranged to match the determined network activity with one or more activity profiles. In one or more of the various embodiments, an analysis engine, such as, analysis engine 326 may be arranged to map the network activity to activity profiles. The process for mapping network activity to activity profiles is described in more detail for FIG. 18.

At block 1708, in one or more of the various embodiments, the one or more NMCs may be arranged to analyze the network activity based on the one or more activity profiles and one or more correlation models. In one or more of the various embodiments, analyzing network traffic based on activity profiles rather than just the network traffic enables results that may be associated with classes of network activity rather than being limited to specific instances of network traffic. In some embodiments, network traffic or network activity associated with one or more of the same activity profiles may occur in different places of the monitored networks. Likewise, in some embodiments, activity or traffic associated with one or more of the same activity profiles may be associated with different entities.

In one or more of the various embodiments, NMCs may be arranged to generate one or more metrics that may be associated with one activity profiles. Accordingly, in one or more of the various embodiments, metrics may be provided relative to activity profiles. In some embodiments, this may enable metrics to be associated with classes of network activity. Also, in some embodiments, some activity profiles may multi-step or multi-party activities (e.g., transactions, or the like) activity profile metrics may be used to track metrics based on the occurrence of the all of the steps, events, and so on, that may comprise an activity profile definition.

Further, in one or more of the various embodiments, as described above and in reference FIG. 19, NMCs may generate one or more correlation models that correlate two or more activity profiles with each other. Accordingly, in some embodiments, activity profile metrics may include or be associated with two or more correlated activity profiles.

In one or more of the various embodiments, NMCs may be arranged to execute one or more rules to generate various metrics or results based on observed activity profiles. Also, in some embodiments, one or more NMCs may be arranged to employ adaptive rules that change monitoring actions, notification actions, capture actions, or the like, based on one or more correlation models. In one or more of the various embodiments, NMCs may be provided rules that increase or decrease one or more monitoring parameters based on correlation models. For example, in some embodiments, if a correlation model indicates that activity profile A is correlated with activity profile X, NMCs may be arranged to increase or decrease one or more monitoring parameters related to activity profile X when network activity associated with activity profile A is observed. For example, if activity profile A is associated with a class of edge requests provided by clients and activity profile X is associated with a database deep inside the monitored networks, an appearance of activity profile A at the edge of the monitored networks may trigger the NMCs to increase monitoring at the databases associated with activity profile X before the network traffic interest reaches the databases. In one or more of the various embodiments, this type of action improves the performance of the one or more NMCs by enabling the NMCs to avoid performing the additional monitoring actions, or the like, until the appearance or detection of the pre-cursor activity profiles.

At block 1710, in one or more of the various embodiments, the one or more NMCs may be arranged to provide the results of the analysis. In one or more of the various embodiments, the results may be provided to one or more services or components that may be arranged to provide reports, user notifications, or the like, based on the provided results.

Next, control may be returned to a calling process.

FIG. 18 illustrates a flowchart of process 1800 for providing activity profiles based on network activity in accordance with one or more of the various embodiments. After a start block, at block 1802, in one or more of the various embodiments, one or more NMCs may be arranged to monitor network traffic associated with one or more network activities in one or more monitored networks.

At block 1804, in one or more of the various embodiments, the one or more NMCs may be arranged to determine network traffic that may be associated with one or more network activities. In one or more of the various embodiments, as described above, not all network traffic monitored in the monitored networks may be of interest with respect to activity profiles or correlation models. Accordingly, in one or more of the various embodiments, one or more NMCs may be arranged to filter some or all of the monitored network traffic or otherwise select a portion of the network traffic in the monitored networks associated with network activities that may be of interest.

At block 1806, in one or more of the various embodiments, the one or more NMCs may be arranged to determine one or more features or one or more attributes from the network traffic associated with the determined network activity. In some embodiments, NMCs may be arranged to determine one or more metrics to associate with the network activity. In one or more of the various embodiments, the selected one or more metrics may include one or more metrics collected by the NMC, such as, source, destination, network protocol, application protocol, bit rate, packet size, response latency, or the like, or combination thereof.

In one or more of the various embodiments, the one or more feature values may be arranged into a data structures or records, such as, vectors, lists, arrays, graphs, or the like. In some embodiments, the feature values may be normalized or modified. Likewise, in some embodiments, one or more of the feature values may be associated with discrete categories or otherwise bucketed. For example, in one or more of the various embodiments, one or more features having continuous values may be mapped to discrete values, such as, high, medium, or low.

At block 1808, in one or more of the various embodiments, the one or more NMCs may be arranged to provide one or more activity profiles based on the features or metrics associated with the network activities. In one or more of the various embodiments, the features associated with the network activities may be compared to features associated with the one or more activity profiles. In some embodiments, the comparison may include comparing one or more patterns or masks that correspond to one or more features of the network activity. In some embodiments, a single feature such as, a URL associated with a HTTP request, may be sufficient to map network activity to an activity profile.

For example, in some embodiments, one feature interest may include a URL pattern that includes wildcards or positional parameters that may match identifiers or other query values included in the URL. Next, control may be returned to a calling process.

FIG. 19 illustrates a flowchart of process 1900 for adapting or modifying NMC actions based on correlated activity profiles in accordance with one or more of the various embodiments. After a start block, at block 1902, in one or more of the various embodiments, one or more NMCs may be arranged to monitor network traffic associated with one or more network activities in one or more monitored networks.

At decision block 1904, in one or more of the various embodiments, if network activity associated with one or more activity profiles is observed by the one or more NMCs, control may flow to block 1906; otherwise, control may loopback to block 1902.

At block 1906, in one or more of the various embodiments, the one or more NMCs may be arranged to determine one or more other activity profiles that may be correlated with the one or more activity profiles detected at decision block 1904. As described above and below in reference to FIG. 20, one or more inference engines may be arranged to have access to one or more correlation models that are trained or configured to determine if one or more activity profiles may be correlated with one or more other activity profiles. Accordingly, in one or more of the various embodiments, if network activity associated with one or more activity profiles is observed, the inference engine may provide the one or more activity profiles to the one or more correlation models to determine if there are activity profiles that are correlated with an observed activity profile.

In one or more of the various embodiments, correlation models may be arranged to have thresholds that determine minimum correlation values for indicating if one activity profile is correlated with another activity profile. Alternatively, in one or more of the various embodiments, one or more correlation models may be arranged to report all detected correlations no matter how weak. In such cases, inference engines may be arranged to employ other rules or filters to if or how to classify a given correlation.

At decision block 1908, in one or more of the various embodiments, if one or more correlated activity profiles are determined, control may flow to block 1910; otherwise, control may flow to bock 1912.

At block 1910, in one or more of the various embodiments, the one or more NMCs may be arranged to modify its one or more of its monitoring actions based on the correlated activity profiles. In some embodiments, the one or more NMCs may be arranged to execute one or more rules that trigger additional monitoring actions, capture actions, notification actions, or the like, based the correlated activity profiles. In some embodiments, the rules may incorporate one or more features of the correlated activity profiles when determine what actions to take or modify. For example, in some embodiments, one or more rules may include instructions to ignore correlated activity profiles that have a correlation value below a given threshold values. Likewise, in some embodiments, one or more rules may include instructions to perform action based on one or more feature details of the network traffic or network activity associated with one or more activity profiles. For example, in one or more of the various embodiments, a rule may include instructions to modify monitoring actions if a correlated activity profile is associated with a particular entity in the monitored networks. Additionally, in one or more of the various embodiments, other network activity features, such as, time of day, user, user group, source/destination addresses, specific payload or header content, application, service, communication protocol, encryption state, cipher suites, or the like.

In one or more of the various embodiments, modifying monitoring actions may include, increasing or decreasing monitoring detail, collecting additional metrics, capturing more or less packet data, or the like. For example, in some embodiments, a NMC may be arranged to begin capturing all of the network traffic coming into or out of an entity because of a correlation activity profile that has yet to occur. Thus, in this example, if the correlated activity profile does occur, the NMC will be capturing the relevant network traffic that may be required for additional analysis. Likewise, for example, correlations may trigger one or more monitoring actions to be deactivated for a time period.

At block 1912, in one or more of the various embodiments, the one or more NMCs may collect or generate one or more metrics based on the activity profiles associated with network traffic or network activity in the monitored networks.

Next, control may be returned to a calling process.

FIG. 20 illustrates a flowchart of process 2000 for providing correlation models based on activity profiles and network activity in accordance with one or more of the various embodiments. After a start block, at block 2002, in one or more of the various embodiments, the one or more NMCs may be arranged to provide one or more activity profiles. In one or more of the various embodiments, as described above, NMCs may be arranged to generate one or more activity profiles based on monitoring network activity in the one or more monitored networks. In one or more of the various embodiments, the activity profiles may be classified into various categories and stored on one or more data stores.

In one or more of the various embodiments, the provided activity profiles may be selected from among the entire collection of activity profiles based on queries, filters, or the like. For example, one or more activity profiles may be selected based on their age. For example, one or more activity profiles may be selected if their age is less than a defined threshold value. Also, in some embodiments, the one or more activity profiles may be selected based on a decrease or increase in network activity matching the activity profiles. In general, in some embodiments, NMCs may be arranged to execute one or more rules, or the like, to select which activity profiles to provide. The rules may include one or more conditions or threshold values that may be used to identify the provided activity profiles.

At block 2004, in one or more of the various embodiments, the one or more NMCs may be arranged to provide the activity profile history. In one or more of the various embodiments, NMCs may be arranged to track the amount of times that network activity associated with a given activity profile has been observed in the one or more monitored networks. In some embodiments, this information may include a data sketch of the network activity that was associated with each activity profile. In some embodiments, this may include one or more metrics, such as, time of occurrence, entities in the network associated with the occurrence, duration of the activity, statistical values associated with various metrics (e.g., mean, median, distributions or the like), or the like.

In one or more of the various embodiments, NMCs may be provided (or may generate) captured network traffic associated with the one or more activity profiles. In some embodiments, the captured network traffic may be stored or indexed in other traffic/packet capture data stores. Accordingly, in one or more of the various embodiments, the NMCs may request captured network traffic associated with one or more provided activity profiles. For example, in one or more of the various embodiments, an activity profile may be associated with one or more incoming/outgoing networks, applications, services, ports, protocols, packet header values, packet payload values, or the like, or combination thereof. Accordingly, in this example, NMCs may provide a request (e.g., provide a query) that includes one or more parameter values to select captured network traffic associated with one or more activity profiles.

At block 2006, in one or more of the various embodiments, the one or more NMCs may be arranged to provide one or more candidate correlation models. In one or more of the various embodiments, an inference engine, such as, inference engine 324, or the like, may be arranged to generate the one or more correlation models based on or more features of the provided activity profiles and the activity profile history. In one or more of the various embodiments, the correlation models may be arranged to determine or model one or more correlations between or among the one or more provided activity profiles.

In one or more of the various embodiments, some or all of the features of different activity profiles may be used to determine correlations. For example, in some embodiments, NMCs may be arranged to use select feature that may be common to all activity profiles to provide initial or general correlations. Alternatively, in some embodiments, NMCs may be arranged to use all the features of each activity profile when trying to determine correlations between the activity profiles. Accordingly, in one or more of the various embodiments, the correlation value associated with two or more activity profiles may be a combination of correlation values of the various features used in the correlation models. In some embodiments, the importance or significance of each feature may vary depending on operational needs or expectations associated with the one or more monitored networks. Accordingly, in one or more of the various embodiments, the activity profile features used to generate correlation models may be weighted or the correlation value associated with particular features may be weighted to reflect importance.

In some embodiments, depending on the correlation method being used, one or more weight values may be generated in the normal course of generating the correlation models. In some embodiments, one or more weight values may be modified based on local or operational considerations of the monitored networks. For example, there may be some features are known produce high correlations values that may mask other weaker correlations that are of more interest.

At block 2008, in one or more of the various embodiments, the one or more NMCs may be arranged to obtain feedback associated with the one or more candidate correlation models. In one or more of the various embodiments, NMCs may be arranged to provide interactive reports that enable users to investigate the correlation models. Accordingly, in one or more of the various embodiments, users may be enabled score or rate one or more correlation models.

In some embodiments, the interactive reports may enable users to associate notifications, events, priorities, triggered actions, or the like, with one or more correlation models. Accordingly, in one or more of the various embodiments, NMCs may be arranged to take various actions based on network activity depending on its activity profile or correlated activity profiles. For example, if network activity associated with activity profile A is observed and a correlation model suggests that this may lead to network activity associated activity profile B, the NMCs may be arranged to provide notifications associated activity profile B to one or more users or services.

At block 2010, in one or more of the various embodiments, the one or more NMCs may be arranged to deploy the one or more correlation models for use in the analysis of network activity. In one or more of the various embodiments, one or more correlation models may be deployed or activated for monitoring network traffic in the one or more monitored networks. In some embodiments, users may be enabled to selectively activate or deactivate one or more correlation models. Also, in some embodiments, NMCs may be arranged to provide some or all monitoring information (e.g., metrics, traffic, protocol information, tuple information, or the like) to some or all activated correlation models.

At block 2012, in one or more of the various embodiments, one or more NMCs may be arranged to evaluate the performance of the one or more correlation models. In one or more of the various embodiments, correlation model performance may be monitored in real-time based on network activity in the monitored networks. In some embodiments, correlation model performance may be evaluated based on historical or archived network activity or network activity metrics.

In one or more of the various embodiments, the evaluation may include measuring if the actual occurrence of related activity profiles matches the correlation values associated with the correlation model. In some embodiments, the network activity of monitored networks may change over time. In some embodiments, this may affect the accuracy of one or more correlation models. The correlation value (e.g., the strength of correlation) represented by one or more correlation models may not match the observed occurrences of correlated activity profiles.

For example, in some embodiments, a correlation model may suggest that activity profile A and activity profile B are strongly correlated, such that if network activity associated with activity profile A is observed there may be an 80% probability of network activity associated with activity profile B being observed. However, continuing with this example, if the NMCs observe that network activity associated with activity profile B is observed after network activity associated with activity profile A is observed 45% of the time, the associated correlation models may receive a poor evaluation.

In one or more of the various embodiments, NMCs may be arranged to evaluate the one or more correlation models based on the variance or deviation of predicted activity profiles from observed activity profiles. In one or more of the various embodiments, correlation models that are determined to be out-of-range with respect to the accuracy of their predictions may be evaluated poorly.

In some embodiments, the NMCs may be arranged to tag or flag the one or more correlation models that receive poor evaluations. In some embodiments, correlation models that receive an evaluation score that is less than a defined threshold value may be automatically deactivated. In some embodiments, NMCs may be arranged to decrease or increase the effective correlation value or correlation strength associated with a correlation model based on the evaluation. In some embodiments, this may be considered a temporary adjustment that may remain in effect until the correlation model is re-trained or the evaluation of the correlation model changes.

At block 2014, in one or more of the various embodiments, optionally, the one or more NMCs may be arranged to determine one or more correlation models for re-training. In one or more of the various embodiments, NMCs may be arranged to periodically re-train some or all correlation models. In some embodiments, all correlation models may be automatically selected for re-training. In other embodiments, one or more correlation models may be selected for re-training based on evaluation scores associated with the correlation models.

In one or more of the various embodiments, the period for re-training correlation models may be impacted by other factors, including correlation model priority, correlation model category, or the like. Accordingly, in one or more of the various embodiments, selecting a correlation model for re-training may depend on various characteristics of the network activity or activity profiles associated with the correlation model, such as, entities, services, applications, sources, destinations, users, or the like, or combination thereof. For example, one or more correlation models associated with mission critical entities may be configured to be re-trained more often than one or more correlation models that may be associated with less important entities. As these determinations may be dependent on the operational considerations of the monitored networks, NMCs may be arranged to employ configuration information provided by configuration files, file system policies, pre-fetch policies, built-in defaults, user input, or the like, combination thereof, to determine re-training frequency or re-training sensitivity. Next, control may be returned to a calling process.

It will be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by computer program instructions. These program instructions may be provided to a processor to produce a machine, such that the instructions, which execute on the processor, create means for implementing the actions specified in the flowchart block or blocks. The computer program instructions may be executed by a processor to cause a series of operational steps to be performed by the processor to produce a computer-implemented process such that the instructions, which execute on the processor to provide steps for implementing the actions specified in the flowchart block or blocks. The computer program instructions may also cause at least some of the operational steps shown in the blocks of the flowchart to be performed in parallel. Moreover, some of the steps may also be performed across more than one processor, such as might arise in a multi-processor computer system. In addition, one or more blocks or combinations of blocks in the flowchart illustration may also be performed concurrently with other blocks or combinations of blocks, or even in a different sequence than illustrated without departing from the scope or spirit of the invention.

Accordingly, blocks of the flowchart illustration support combinations of means for performing the specified actions, combinations of steps for performing the specified actions and program instruction means for performing the specified actions. It will also be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by special purpose hardware based systems, which perform the specified actions or steps, or combinations of special purpose hardware and computer instructions. The foregoing example should not be construed as limiting or exhaustive, but rather, an illustrative use case to show an implementation of at least one of the various embodiments of the invention.

Further, in one or more embodiments (not shown in the figures), the logic in the illustrative flowcharts may be executed using an embedded logic hardware device instead of a CPU, such as, an Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA), Programmable Array Logic (PAL), or the like, or combination thereof. The embedded logic hardware device may directly execute its embedded logic to perform actions. In one or more embodiment, a microcontroller may be arranged to directly execute its own embedded logic to perform actions and access its own internal memory and its own external Input and Output Interfaces (e.g., hardware pins or wireless transceivers) to perform actions, such as System On a Chip (SOC), or the like. 

What is claimed as new and desired to be protected by Letters Patent of the United States is:
 1. A method for monitoring network traffic using one or more network computers, wherein execution of instructions by the one or more network computers perform the method comprising: instantiating a monitoring engine to perform actions, including: monitoring one or more portions of the network traffic that are associated with a plurality of entities in one or more networks to provide one or more metrics; and instantiating an inference engine that performs actions, including: providing one or more activity profiles based on the plurality of entities and the one or more portions of the network traffic, wherein each activity profile includes features based on the one or more metrics, the plurality of entities, or the one or more portions of the network traffic; determining one or more other activity profiles that correlate with the one or more activity profiles based on one or more correlation models; monitoring one or more other portions of the network traffic associated with the one or more other activity profiles, wherein the determination of the one or more other activity profiles occurs separate from the monitoring of the one or more other portions of the network traffic that are associated with the one or more other activity profiles; modifying one or more actions of the monitoring engine based on the one or more other activity profiles; and providing one or more reports based on the one or more portions of the network traffic, the one or more activity profiles, the one or more other portions of the network traffic, or the one or more other activity profiles, wherein the one or more reports are provided to one or more users.
 2. The method of claim 1, wherein modifying the one or more actions of the monitoring engine, further comprises, modifying the monitoring of the network traffic based on the one or more other activity profiles, wherein the modifications include collecting one or more additional metrics associated with the one or more other portions of the network traffic that are associated with the one or more other activity profiles.
 3. The method of claim 1, wherein the inference engine performs further actions, including, associating the one or more portions of the network traffic with the one or more activity profiles based on the one or more portions of the network traffic matching one or more of the features that are associated with the one or more activity profiles.
 4. The method of claim 1, wherein providing the one or more activity profiles, further comprises, matching the one or more portions of the network traffic to the one or more activity profiles based on one or more characteristics of the one or more portions of the network traffic, wherein the one or more characteristics include one or more of network flow features, packet size, bit rate of the network traffic, communication protocol, encryption status, encryption state, encryption version, encryption type, encryption cipher, error rate, device relations based on one or more device relation models, time-of-day, day-of-week, user, user group, source tuple information, destination tuple information, application type, service type, payload or header content, or geolocation information.
 5. The method of claim 1, wherein the inference engine performs further actions, including, training the one or more correlation models based on the network traffic and the one or more activity profiles, wherein the one or more correlation models provide correlations that predict a likelihood of the occurrence of the one or more other portions of network traffic subsequent to the occurrence of the one or more portions of the network traffic based on one or more dependencies associated with the one or more activity profiles and the one or more other activity profiles.
 6. The method of claim 1, wherein the inference engine performs further actions, including: determining a score for each of the one or more correlation models based on the occurrence of the one or more other portions of network traffic subsequent to the one or more portions of the network traffic; determining the one or more correlation models that require re-training based on the determined score having a value that is below a threshold value; and re-training the one or more determined correlation models based on the network traffic.
 7. The method of claim 1, wherein providing the one or more activity profiles, further comprises, providing computer readable instructions that are executed to enforce one or more matching rules or filter rules that are associated with the one or more activity profiles, wherein the one or more matching rules or filter rules include one or more of regular expressions, one or more compound rules, one or more cascading rules, or one or more sub-rules.
 8. The method of claim 1, wherein providing the one or more reports, further comprises, one or more of detecting one or more anomalies associated with the one or more activity profiles, identifying one or more capacity planning actions based on the one or more activity profiles, or tracing one or more transactions based on the one or more activity profiles.
 9. A processor readable non-transitory storage media that includes instructions for monitoring network traffic using one or more network monitoring computers, wherein execution of the instructions by the one or more network computers perform the method comprising: instantiating a monitoring engine to perform actions, including: monitoring one or more portions of the network traffic that are associated with a plurality of entities in one or more networks to provide one or more metrics; and instantiating an inference engine that performs actions, including: providing one or more activity profiles based on the plurality of entities and the one or more portions of the network traffic, wherein each activity profile includes features based on the one or more metrics, the plurality of entities, or the one or more portions of the network traffic; determining one or more other activity profiles that correlate with the one or more activity profiles based on one or more correlation models; monitoring one or more other portions of the network traffic associated with the one or more other activity profiles, wherein the determination of the one or more other activity profiles occurs separate from the monitoring of the one or more other portions of the network traffic that are associated with the one or more other activity profiles; modifying one or more actions of the monitoring engine based on the one or more other activity profiles; and providing one or more reports based on the one or more portions of the network traffic, the one or more activity profiles, the one or more other portions of the network traffic, or the one or more other activity profiles, wherein the one or more reports are provided to one or more users.
 10. The media of claim 9, wherein modifying the one or more actions of the monitoring engine, further comprises, modifying the monitoring of the network traffic based on the one or more other activity profiles, wherein the modifications include collecting one or more additional metrics associated with the one or more other portions of the network traffic that are associated with the one or more other activity profiles.
 11. The media of claim 9, wherein the inference engine performs further actions, including, associating the one or more portions of the network traffic with the one or more activity profiles based on the one or more portions of the network traffic matching one or more of the features that are associated with the one or more activity profiles.
 12. The media of claim 9, wherein providing the one or more activity profiles, further comprises, matching the one or more portions of the network traffic to the one or more activity profiles based on one or more characteristics of the one or more portions of the network traffic, wherein the one or more characteristics include one or more of network flow features, packet size, bit rate of the network traffic, communication protocol, encryption status, encryption state, encryption version, encryption type, encryption cipher, error rate, device relations based on one or more device relation models, time-of-day, day-of-week, user, user group, source tuple information, destination tuple information, application type, service type, payload or header content, or geolocation information.
 13. The media of claim 9, wherein the inference engine performs further actions, including, training the one or more correlation models based on the network traffic and the one or more activity profiles, wherein the one or more correlation models provide correlations that predict a likelihood of the occurrence of the one or more other portions of network traffic subsequent to the occurrence of the one or more portions of the network traffic based on one or more dependencies associated with the one or more activity profiles and the one or more other activity profiles.
 14. The media of claim 9, wherein the inference engine performs further actions, including: determining a score for each of the one or more correlation models based on the occurrence of the one or more other portions of network traffic subsequent to the one or more portions of the network traffic; determining the one or more correlation models that require re-training based on the determined score having a value that is below a threshold value; and re-training the one or more determined correlation models based on the network traffic.
 15. The media of claim 9, wherein providing the one or more activity profiles, further comprises, providing computer readable instructions that are executed to enforce one or more matching rules or filter rules that are associated with the one or more activity profiles, wherein the one or more matching rules or filter rules include one or more of regular expressions, one or more compound rules, one or more cascading rules, or one or more sub-rules.
 16. A system for monitoring network traffic in a network: one or more network computers, comprising: a transceiver that communicates over the network; a memory that stores at least instructions; and one or more processors that execute instructions that perform actions, including: instantiating a monitoring engine to perform actions, including: monitoring one or more portions of the network traffic that are associated with a plurality of entities in one or more networks to provide one or more metrics; and instantiating an inference engine that performs actions, including: providing one or more activity profiles based on the plurality of entities and the one or more portions of the network traffic, wherein each activity profile includes features based on the one or more metrics, the plurality of entities, or the one or more portions of the network traffic; determining one or more other activity profiles that correlate with the one or more activity profiles based on one or more correlation models; monitoring one or more other portions of the network traffic associated with the one or more other activity profiles, wherein the determination of the one or more other activity profiles occurs separate from the monitoring of the one or more other portions of the network traffic that are associated with the one or more other activity profiles; modifying one or more actions of the monitoring engine based on the one or more other activity profiles; and providing one or more reports based on the one or more portions of the network traffic, the one or more activity profiles, the one or more other portions of the network traffic, or the one or more other activity profiles, wherein the one or more reports are provided to one or more users; and one or more client computers, comprising: a transceiver that communicates over the network; a memory that stores at least instructions; and one or more processors that execute instructions that perform actions, including: providing one or more of the one or more portions of the network traffic.
 17. The system of claim 16, wherein modifying the one or more actions of the monitoring engine, further comprises, modifying the monitoring of the network traffic based on the one or more other activity profiles, wherein the modifications include collecting one or more additional metrics associated with the one or more other portions of the network traffic that are associated with the one or more other activity profiles.
 18. The system of claim 16, wherein the inference engine performs further actions, including, associating the one or more portions of the network traffic with the one or more activity profiles based on the one or more portions of the network traffic matching one or more of the features that are associated with the one or more activity profiles.
 19. The system of claim 16, wherein providing the one or more activity profiles, further comprises, matching the one or more portions of the network traffic to the one or more activity profiles based on one or more characteristics of the one or more portions of the network traffic, wherein the one or more characteristics include one or more of network flow features, packet size, bit rate of the network traffic, communication protocol, encryption status, encryption state, encryption version, encryption type, encryption cipher, error rate, device relations based on one or more device relation models, time-of-day, day-of-week, user, user group, source tuple information, destination tuple information, application type, service type, payload or header content, or geolocation information.
 20. The system of claim 16, wherein the inference engine performs further actions, including, training the one or more correlation models based on the network traffic and the one or more activity profiles, wherein the one or more correlation models provide correlations that predict a likelihood of the occurrence of the one or more other portions of network traffic subsequent to the occurrence of the one or more portions of the network traffic based on one or more dependencies associated with the one or more activity profiles and the one or more other activity profiles.
 21. The system of claim 16, wherein the inference engine performs further actions, including: determining a score for each of the one or more correlation models based on the occurrence of the one or more other portions of network traffic subsequent to the one or more portions of the network traffic; determining the one or more correlation models that require re-training based on the determined score having a value that is below a threshold value; and re-training the one or more determined correlation models based on the network traffic.
 22. The system of claim 16, wherein providing the one or more activity profiles, further comprises, providing computer readable instructions that are executed to enforce one or more matching rules or filter rules that are associated with the one or more activity profiles, wherein the one or more matching rules or filter rules include one or more of regular expressions, one or more compound rules, one or more cascading rules, or one or more sub-rules.
 23. The system of claim 16, wherein providing the one or more reports, further comprises, one or more of detecting one or more anomalies associated with the one or more activity profiles, identifying one or more capacity planning actions based on the one or more activity profiles, or tracing one or more transactions based on the one or more activity profiles.
 24. A network computer for monitoring communication over a network between two or more computers, comprising: a transceiver that communicates over the network; a memory that stores at least instructions; and one or more processors that execute instructions that perform actions, including: instantiating a monitoring engine to perform actions, including: monitoring one or more portions of the network traffic that are associated with a plurality of entities in one or more networks to provide one or more metrics; and instantiating an inference engine that performs actions, including: providing one or more activity profiles based on the plurality of entities and the one or more portions of the network traffic, wherein each activity profile includes features based on the one or more metrics, the plurality of entities, or the one or more portions of the network traffic; determining one or more other activity profiles that correlate with the one or more activity profiles based on one or more correlation models; monitoring one or more other portions of the network traffic associated with the one or more other activity profiles, wherein the determination of the one or more other activity profiles occurs separate from the monitoring of the one or more other portions of the network traffic that are associated with the one or more other activity profiles; modifying one or more actions of the monitoring engine based on the one or more other activity profiles; and providing one or more reports based on the one or more portions of the network traffic, the one or more activity profiles, the one or more other portions of the network traffic, or the one or more other activity profiles, wherein the one or more reports are provided to one or more users.
 25. The network computer of claim 24, wherein modifying the one or more actions of the monitoring engine, further comprises, modifying the monitoring of the network traffic based on the one or more other activity profiles, wherein the modifications include collecting one or more additional metrics associated with the one or more other portions of the network traffic that are associated with the one or more other activity profiles.
 26. The network computer of claim 24, wherein the inference engine performs further actions, including, associating the one or more portions of the network traffic with the one or more activity profiles based on the one or more portions of the network traffic matching one or more of the features that are associated with the one or more activity profiles.
 27. The network computer of claim 24, wherein providing the one or more activity profiles, further comprises, matching the one or more portions of the network traffic to the one or more activity profiles based on one or more characteristics of the one or more portions of the network traffic, wherein the one or more characteristics include one or more of network flow features, packet size, bit rate of the network traffic, communication protocol, encryption status, encryption state, encryption version, encryption type, encryption cipher, error rate, device relations based on one or more device relation models, time-of-day, day-of-week, user, user group, source tuple information, destination tuple information, application type, service type, payload or header content, or geolocation information.
 28. The network computer of claim 24, wherein the inference engine performs further actions, including, training the one or more correlation models based on the network traffic and the one or more activity profiles, wherein the one or more correlation models provide correlations that predict a likelihood of the occurrence of the one or more other portions of network traffic subsequent to the occurrence of the one or more portions of the network traffic based on one or more dependencies associated with the one or more activity profiles and the one or more other activity profiles.
 29. The network computer of claim 24, wherein the inference engine performs further actions, including: determining a score for each of the one or more correlation models based on the occurrence of the one or more other portions of network traffic subsequent to the one or more portions of the network traffic; determining the one or more correlation models that require re-training based on the determined score having a value that is below a threshold value; and re-training the one or more determined correlation models based on the network traffic.
 30. The network computer of claim 24, wherein providing the one or more activity profiles, further comprises, providing computer readable instructions that are executed to enforce one or more matching rules or filter rules that are associated with the one or more activity profiles, wherein the one or more matching rules or filter rules include one or more of regular expressions, one or more compound rules, one or more cascading rules, or one or more sub-rules. 